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A class  of  affine-invariant  signed-rank  test  statistics  is  proposed  for  the  one- 
sample  multivariate  location  problem.  This  class  is  based  on  the  transformation- 
retransformation  approach  and  a directional  transformation.  It  contains  several 
test  statistics  that  have  been  previously  proposed  for  this  problem.  For  the  class  of 
symmetric  distributions,  the  asymptotic  null  distribution  of  the  proposed  class  of 
statistics  is  shown  to  be  chi-square  with  p degrees  of  freedom,  where  p represents 
the  dimension  of  the  observed  vectors.  In  addition,  the  asymptotic  distribution 
of  the  test  statistic,  under  a sequence  of  alternatives  approaching  the  null,  is 
obtained.  The  Pitman  asymptotic  relative  efficiency  of  this  class  with  respect 
to  Hotelling’s  is  established.  One  particular  member  of  this  class  of  tests  is 
recommended.  This  specific  statistic  is  obtained  by  using  a linear  function  of 
the  ranks  of  Euclidean  norms  of  the  transformed  vectors.  Comparisons  of  this 


Vll 


new  test  statistic  and  several  leading  competitors  are  made  through  efficiency 
calculations  and  Monte  Carlo  studies.  The  new  statistic  has  strong  efficiencies 
over  a wide  spectrum  of  distributions,  ranging  from  very  light-tailed  distributions 
to  very  heavy-tailed  ones.  It  performs  as  well  as  and  in  many  cases  better  than 
its  competitors.  Yet,  the  best  feature  of  this  statistic  is  that  it  is  very  easy  to 
compute  for  data  in  any  practical  dimension.  This  distinguishes  it  from  the  other 
affine-invariant  signed-rank  tests  in  the  literature. 

This  new  signed-rank  test  is  also  applied  to  a simple  repeated-measures 
setting.  Since  the  test  is  affine-invariant,  it  is  not  affected  by  the  structure  of 
the  variance-covariance  matrix  of  the  underlying  population,  as  is  the  case  with 
some  procedures  in  the  literature.  Once  again,  the  particular  signed-rank  statistic 
developed  in  this  dissertation  showed  strong  efficiencies  and  performed  very  well  in 
comparison  with  other  well-known  procedures  over  a wide  variety  of  distributional 
assumptions. 
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CHAPTER  1 
INTRODUCTION 


This  dissertation  investigates  test  statistics  for  the  one-sample  multivariate 
location  problem.  A new  multivariate  signed-rank  procedure  is  proposed  and 
its  application  to  certain  repeated-measures  designs  is  studied.  The  one-sample 
problem  is  introduced  in  this  chapter  while  the  repeated-measures  model  is 
presented  in  Chapter  4. 

For  the  one-sample  location  problem,  assume  that  Yi,  • • • , Y„  are  inde- 
pendently and  identically  distributed  (iid)  as  Y,  where  Y=  (Yi,  • • • ,Yp)'  has 
an  absolutely  continuous  p-dimensional  distribution  with  location  parameter 
6 = ••  • , $p)'.  We  wish  to  test 

Ho  : 0 = 0 against  Ha  ■ 0 ^ 0.  (1.1) 


The  value  0 is  used  without  loss  of  generality,  since  if  the  null  was  0 = 0q,  the  test 
of  (1.1)  would  be  performed  on  the  random  variables  Yi  — • • • > Y„  — 0q. 

Probably  the  best  known  parametric  procedure  for  this  setting  is  Hotelling’s 
(1931)  T^,  which  is  based  on  the  statistic 


T2  ^ • • • , Y„)  = nY'(S)-'Y, 


where 


1 " 

Y = iyY 

''‘ti 


and 


S = ^E(Y,-Y)(Y,-Y)'. 

i=l 


Under  Hg,  if  S,  the  variance-covariance  matrix  of  Y,  is  positive  definite,  then 
has  an  asymptotic  chi-square  distribution  with  p-degrees  of  freedom  (xp-  When 


1 


2 


Y has  a distribution  that  is  p-variate  normal,  is  the  likelihood  ratio  test  of 
(1.1)  and  the  null  distribution  of  becomes  a multiple  of  an  F-distribution.  In 
particular,  Hq  is  rejected  in  favor  of  if  where  F„i,,,2(q!)  is 

the  upper  ath  quantile  of  the  F-distribution  with  % numerator  degrees  of  freedom 
and  r]2  denominator  degrees  of  freedom.  This  procedure  is  invariant  with  respect 
to  nonsingular  linear  transformations  of  the  observations.  That  is,  the  test  statistic 
satisfies: 

T"(yi,  ■ ■ ■ , y„)  = T"(Dy„  ■ ■ . , Dy„)  (1.2) 


for  every  (yi,  • • • , y„)  and  all  nonsingular  p x p matrices  D.  This  property  ensures 
that  the  value  of  the  test  statistic  remains  unchanged  following  rotations  of  the 
observations  about  the  origin,  reflections  of  the  observations  about  a (p  — 1)- 
dimensional  hyper-plane  through  the  origin,  or  changes  in  scale.  This  desirable 
invariance  property  will  be  called  affine-invariance. 

Early  competitors  of  Hotelling’s  were  restricted  to  the  bivariate  case 
(p  = 2).  Hodges  (1955)  and  Blumen  (1958)  proposed  bivariate  aflfine-invariant  sign 
tests  that  are  distribution-free  for  the  class  of  distributions  that  are  directionally 
symmetric  (i.e.,  under  i/o,  = “]|^)  where  ||  • ||  represents  the  Euclidean 
norm).  Blumen  showed  that  under  the  null  hypothesis  his  test  has  an  asymptotic 
chi-square  distribution  with  2 degrees  of  freedom.  Joffe  and  Klotz  (1962)  gave 
expressions  for  the  exact  and  asymptotic  null  distributions  of  Hodges’  test.  They 
also  calculated  the  Bahadur  approximate  efficiency  of  Hodges’  test  with  respect 
to  Hotelling’s  when  the  underlying  distribution  of  the  Y’s  is  the  bivariate 
normal  distribution.  For  8 < n < 12,  Koltz  (1964)  computed  exact  powers  of 
the  Blumen  and  Hodges’  tests  under  normal  location  alternatives  and  efficiencies 
of  Hodges’  test  relative  to  Hotelling’s  T^.  Bennett  (1962  and  1964)  proposed  two 
new  tests.  The  first  procedure  is  based  on  the  frequencies  associated  with  the 
signs  of  the  components.  The  second  procedure  is  a quadratic  form  based  on  the 
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component-wise  Wilcoxon’s  signed-rank  statistic.  Both  tests  have  a asymptotic 
null  distribution.  Bennett’s  tests  are  not  affine-invariant.  Chatterjee  (1966)  also 
proposed  a bivariate  distribution-free  test  that  has  a xl  limiting  null  distribution. 
Killeen  and  Hettmansperger  (1972)  proposed  a new  bivariate  signed  rank  test 
that  is  also  a generalization  of  Wilcoxon’s  signed-rank  test.  They  made  an  exact 
Bahadur  efficiency  comparison  of  Hotelling’s  with  respect  to  the  new  test, 
Blumen’s  test  and  Hodges’s  test.  Brown  and  Hettmansperger  (1987)  proposed 
an  affine-invariant  bivariate  signed-rank  test  based  on  the  median  of  Oja  (1983). 
Brown  and  Hettmansperger  (1989)  also  developed  an  affine-invariant  bivariate 
version  of  the  sign  test. 

For  the  more  general  case,  p > 1,  many  nonparametric  analogs  of  Hotelling’s 
have  been  proposed.  Bickel  (1965)  proposed  using  quadratic  forms  involving 
coordinate-wise  sign  and  Wilcoxon  signed-rank  statistic  respectively.  Sen  and  Puri 
(1967)  suggested  a more  general  approach  by  forming 

n 

Bj  = ^2 

i=l 

where  Eij  is  a general  score  function  applied  to  the  rank  of  among  |Yij|,  • • • , 
and  Cij  = -f  1 or  -1  according  to  whether  Yij  > 0 or  < 0 respectively.  Then  the  test 
is  based  on 

Sn  = -B(W-i)B', 
n 

where  B = (Bi,  • • • , Bp)  and  AV  = ^jj'  ~ BijEijiCijCiji . The 

limiting  distribution  of  Sn,  under  the  null  hypothesis,  is  xl-  Unlike  Hotelling’s 
T^,  these  tests  are  not  affine-invariant.  Thus,  their  power  and  efficiency  depend 
on  the  direction  of  the  shift  of  6 from  the  origin  and  the  variance-covariance 
matrix.  Dietz  (1982)  tried  to  create  an  affine-equivariant  test  procedure  for  the 
bivariate  case  by  means  of  a three-step  procedure.  First,  a certain  transformation 
is  applied  to  the  observations.  Second,  new  coordinate  axes  are  chosen.  Third, 
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Bickel’s  coordinate-wise  sign  and  signed-rank  tests  are  applied  on  the  transformed 
and  rotated  observations.  This  multi-step  process  was  created  to  achieve  affine- 
invariance  with  a test  procedure  based  on  coordinate-wise  statistics.  Dietz  (1984) 
extended  those  results  to  general  linear  rank  tests  for  the  multivariate  location 
problem.  In  this  extension,  step  two  is  dropped  and  the  resulting  statistics  are  no 
longer  invariant  under  nonsingular  linear  transformations.  However,  for  elliptically 
symmetric  contiguous  alternatives,  the  asymptotic  efficiencies  of  the  Dietz  test  are 
independent  of  the  direction  of  shift  and  the  underlying  variance-covariance  matrix; 
moreover,  its  efficiencies  are  equal  to  those  of  the  corresponding  univariate  linear 
signed-rank  tests. 

Many  affine-invariant  nonparametric  multivariate  procedures  have  been 
proposed  for  this  problem.  We  will  mention  most  of  the  tests  with  a detailed 
discussion  for  tests  which  are  extensions  of  the  univariate  Wilcoxon  signed  rank 
test. 

Randles  (1989)  introduced  a multivariate  affine-invariant  sign  test  based  on 
counts  called  interdirections.  Randles’  test  includes  both  the  univariate  sign  test 
and  Blumen’s  bivariate  sign  test  as  special  cases.  It  has  small-sample  distribution 
free  property  and  if  the  observations  are  from  distributions  with  elliptical  directions 
(see  appendix  B for  definitions  of  various  classes  of  distributions  and  relationships 
between  them),  then  the  null  limiting  distribution  of  the  test  is  xl-  This  test 
performs  well  compared  with  Hotelling’s  for  heavy-tailed  distributions  but  not 
as  well  for  light-tailed  distributions.  Because  of  the  complicated  nature  of  these 
counts,  the  test’s  calculation  becomes  more  tedious  when  the  dimension  and/or  the 
sample  size  increase. 

Hettmansperger,  Nyblom  and  Oja  (1994)  extended  the  Brown  and  Hettmansperger 
bivariate  sign  test  to  the  general  multivariate  case.  For  elliptical  distributions,  the 
Pitman  efficiencies  of  their  test  statistic  are  equivalent  to  those  of  Randles’  (1989) 
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sign  test.  Hettmansperger,  Mott5nen  and  Oja  (1997)  suggested  an  affine-invariant 
signed-rank  test  based  on  the  generalized  median  of  Oja.  Their  work  extends  some 
of  the  bivariate  rank  methods  discussed  by  Brown  and  Hettmansperger  (1987). 
They  assumed  that  the  Y s come  from  a p- variate  continuous  distribution  that  is 
symmetric  around  0.  To  be  able  to  describe  their  test  statistic,  let 

Vk(0)  = Vk(yfci,---  ,yfcp,0)  = ^abs 

P- 

denote  the  volume  of  the  simplex  determined  by  y^j,  • • • , y^p  along  with  0.  Here 
abs{}  indicates  the  absolute  value.  Let  K be  the  set  of  all  the  Nk  = {'!)  different 
p-tuples  chosen  from  the  integers  1,  • • • ,n.  Also,  k = (fci,  • • • , fcp)  G A"  represents 
a p-subset  from  the  original  observations.  Note  that  Oja’s  median  (1983)  is  the 
vector  0 that  minimizes  Dn{0)  = Ek€KVk(«).  Hettmansperger,  Mottonen 
and  Oja  defined  the  “rank”  of  yj  as 

keK 


where 

Qk(yi)  =p!Wk(yi) 


and  VVk(y)  is  the  gradient  of  Vk(0)  evaluated  at  0 = y.  They  showed  that 
R,j(0)  = 0 and  that  a sign  test  oi  Ho  : 0 = 0 can  be  based  on  a quadratic  form  in 
R„(0).  The  signed- rank  of  yj  is  obtained  in  the  following  manner.  Let  A be  the  set 
of  all  2^  possible  vectors  (±1,  • • • , ±1)  and  define 

Qj(yi)=2-'5]p!VV.j,(yi). 

, Op)  and  Vak(y)  = Vk(aiyfci,  • • • , Opyfcp,y).  The  signed-rank  of 


R-n  (yi)  = 

keK 


where  a = (oi, 
Yi  is  then 


• • • 
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and  the  signed-rank  statistic  is 

n 

T = J^Rj(yi). 

i=l 

Under  the  assumption  that  the  observations  are  from  a distribution  that  is  sym- 
metric around  0,  the  null  hypothesis  in  (1.1)  is  rejected  if  n“^T'B~^T  > Xp(o:), 
where  B = n ^ Z)r=i  (yi)I^n  (y»)  xliol)  is  the  upper  a quantile  of  a 
chi-square  distribution  with  p degrees  of  freedom.  Mottonen  et  al.  (1998)  gave 
a formula  for  calculating  the  Asymptotic  Relative  Efficiency  (ARE)  of  this  test 
statistic  relative  to  Hotelling’s  T^.  Their  results  showed  that  the  test  statistic  is 
more  efficient  than  Hotelling’s  when  the  underlying  distribution  of  the  Y’s  is 
the  multivariate  t-distribution  with  small  degrees  of  freedom  and  as  the  degrees  of 
freedom  increases,  eventually  to  oo,  the  test  statistic  performs  very  well  as  com- 
pared to  Hotelling’s  T^.  A drawback  of  this  test  is  that,  as  the  dimension  or  the 
sample  size  increases,  it  becomes  very  difficult  to  calculate. 

Peters  and  Randles  (1990)  and  Jan  and  Randles  (1994)  developed  an  affine- 
invariant  signed- rank  test  and  signed  sum  test,  respectively.  Both  procedures 
are  modifications  of  Randles  (1989)  multivariate  sign  test  based  on  the  notion  of 
interdirection.  The  interdirection  Cjk  of  two  observations  Yj  and  Yfc  is  defined  as 
the  number  of  hyper-planes  formed  by  the  origin  and  p — 1 observations  (excluding 
Yj  and  Y*)  such  that  Yj  and  Yfe  are  on  opposite  sides  of  the  hyperplane.  Peters 
and  Randles  assumed  that  the  observations  come  from  an  elliptically  symmetric 
distribution  and  thus  it  was  natural  to  rank  the  observations  according  to  their 
distances  from  the  origin  measured  in  terms  of  elliptical  contours.  Using  this  idea 
of  ranking,  Peters  and  Randles  proposed  the  statistic 


n n 


COs{7TPjk) 


Rj  Rk 


j=i  k=i 


n n 


(1.3) 


7 


where  Ri  denotes  the  rank  of  A = YiE“^Yi  among  Y^E'^Yi,  • • • , Y^S“^Y„, 


t = n-^Y,  YiY;. 

i=l 


Pik  = < 


{Cjk  + dn)/ (p”i)  if  j 7^  A: 


0 


if  3 = k, 


and  dji  = (l/2){(p”^)  — They  proved  that  if  the  Ys  are  iid  from  an 

elliptically  symmetric  density  with  var(Y)=E  and  — S)  = Op(l),  then 

Wn  has  a limiting  xl  distribution  under  the  null  hypothesis.  Peters  and  Randles 
showed  through  ARE  calculations  and  a Monte  Carlo  study  that  Wn  performs 
better  than  Hotelling’s  for  both  light-tailed  and  heavy-tailed  distributions  and 
virtually  as  well  &s  Hotelling’s  under  the  multivariate  normal  distribution  when 
p = 2 or  3.  However,  as  the  dimension  increases,  not  only  does  VPn’s  computation 
become  difficult  but  also  its  performance  deteriorates,  especially  in  the  light-tailed 
cases  and  the  normal  case.  In  fact  the  Pitman  efficiency  of  this  test  relative  to 
Hotelling’s  goes  to  .75  as  p — oo  for  distribution  in  the  power  family  (see  Peters 
and  Randles,  1990,  p.553).  The  power  family  of  distributions  will  be  defined  in 
more  details  in  Chapter  3. 

The  signed  sum  test  of  Jan  and  Randles  results  from  applying  Randles  (1989) 
sign  test  to  the  sums  of  pairs  of  observations.  Thus,  the  statistic  has  the  form 


4p 

n(n  -I- 1)2 


/ I cos  {TVPik,i'k> ) , 

i<k  %'<k' 


where 


/ 


V 


if  (i,  k)  ^ {i',  k') 


if  (i,k)  = {i',k'), 
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and  Cik,i'k'  is  the  number  of  hyper-planes  formed  by  the  origin  0 and  other  p — 1 
observations  (excluding  Yj,  Y^,  Y*/,  and  Yk>)  such  that  Y^  -I-  Yjt  and  Yj/  +Yk>  are 
on  opposite  sides  of  the  hyper-plane  formed.  This  statistic  has  a limiting  null 
distribution  when  the  Y s come  from  an  elliptically  symmetric  distribution  and  a 
better  efficiency  than  Hotelling’s  only  under  heavy-tailed  distributions.  While 
its  Pitman  efficiencies  do  not  deteriorate  as  the  dimension  increases  as  was  the  case 
for  the  test  by  Peters  and  Randles,  its  calculation  rapidly  becomes  quite  difficult 
when  the  dimension  and/or  the  sample  size  increases. 

Hossjer  and  Croux  (1995)  developed  a method  to  generalize  signed-rank 
statistics  to  dimensions  higher  than  one.  When  the  underlying  distribution  of  the 
Ys  is  elliptically  symmetric,  they  suggested  transforming  the  observations  using 
an  equivariant  estimate  of  the  population  covariance  matrix,  say  that  is  based 
on  the  observations  and  then  calculating  a test  statistic  using  the  transformed 
observations.  They  based  their  test  on  the  statistic 


n 


i=l 


where,  a„(l),  • • • , a„(n)  are  non-negative  scores  satisfying  some  regularity  condi- 


,-1/2 


1/2. 


,-1/2. 


tions  and  R(||S„  Yj||)  is  the  rank  of  ||E„  Yi||  among  ||S,j  Yi 

^ 

They  suggested  using  S = S aa  an  estimator  of  the  variance-covariance  matrix  or 
a high  break  down  estimator  from  the  S-estimator  class.  In  their  paper,  they  also 
gave  formulas  for  calculating  the  asymptotic  relative  efficiencies  for  these  gener- 
alized tests  with  respect  to  Hotelling’s  T^.  They  demonstrated  that  the  efficiency 
depends  to  a large  extent  on  the  choice  of  the  scores.  For  that  matter,  they  showed 
that  for  the  multivariate  normal  distribution  the  efficiency  of  the  sign  scores  (i.e. 
a„  = 1)  approaches  one  as  the  dimension  increases  while  the  efficiency  for  the 
Wilcoxon  scores  (i.e.  a„(z)  = i)  decreases  to  .75.  Note  that,  although  Hossjer 
and  Croux  gave  this  generalization  that  is  simple  to  apply,  they  did  not  suggest 
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a “general”  score  function  that  can  maintain  a high  efficiency  regardless  of  the 
dimension.  What  is  meant  by  general  is  that  it  works  well  for  all  elliptically  sym- 
metric distributions.  In  particular,  their  multivariate  extension  of  the  univariate 
signed-rank  test  (a„(i)  = i)  has  efficiencies  that  deteriorate  to  .75  (like  those  of 
Peters  and  Randles)  for  any  distribution  in  the  power  family  class. 

Recently,  Randles  (2000)  proposed  a new  sign  test  which  has  very  desir- 
able characteristics.  It  uses  the  transformation-retransformation  approach  of 
Chakraborty,  Chaudhuri,  and  Oja  (1998)  together  with  a directional  transformation 
due  to  Tyler  (1987).  This  new  multivariate  sign  test  is  practical  to  apply  to  data  of 
any  dimension,  makes  minimal  assumptions  about  the  underlying  distribution  and 
has  a small  sample  distribution-free  property  over  a broad  class  of  distributions. 

In  Chapter  2,  Randles  (2000)  sign  test  is  described  and,  based  on  it,  a new 
affine-invariant  multivariate  class  of  signed-rank  tests  is  proposed.  This  new  test 
makes  minimal  assumptions  about  the  underlying  population  and  is  much  easier 
to  compute  than  any  of  the  others  in  the  literature  (except  possibly  the  test  by 
Hossjer  and  Croux)  for  data  in  any  practical  dimension.  It’s  affine-invariance 
property  as  well  as  its  null  limiting  distribution  are  established.  The  asymptotic 
distribution  of  the  new  test,  under  Pitman  alternatives,  is  investigated  in  Chapter 
3,  which  also  includes  the  comparisons  of  a particular  member  of  this  class  with 
several  one-sample  multivariate  procedures  via  asymptotic  relative  efficiencies  and 
Monte-Carlo  results.  In  particular,  it  is  demonstrated  that  this  new  multivariate 
extension  of  the  univariate  signed-rank  test  has  very  desirable  Pitman  efficiencies 
and  that  these  efficiencies  do  not  deteriorate  for  p -4  oo,  as  was  the  case  for  the 
Peters-Randles  and  Hossjer-Croux  extensions.  The  repeated  measures  problem 
setting  is  introduced  in  Chapter  4 along  with  a literature  review  of  procedures  used 
with  such  a design.  The  application  of  the  new  signed-rank  test  and  Randles  (2000) 
sign  test  to  this  problem  is  investigated  in  Chapter  5.  Finally,  several  procedures 
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CHAPTER  2 

A MULTIVARIATE  ONE-SAMPLE  TEST  FOR  LOCATION 


2.1  Definition  of  the  Test  Statistic 

Since  the  new  signed-rank  test  statistic  is  a modification  of  Randles  (2000) 
multivariate  sign  test,  it  is  important  to  describe  this  sign  test  in  more  detail  than 
the  others  described  in  Chapter  1. 

Randles  (2000)  suggests  using  the  transformation-retransformation  approach 
where  each  observation  Yj  is  transformed  by  a data-determined  nonsingular  matrix 
A = Ay,  to  form  AYj.  Unit  vectors  are  created  from  the  transformed  data,  via 

A 

Vj  = TTTvV)  where  ||  • ||  is  the  Euclidean  norm.  The  test  statistic  is  then  a quadratic 

||A  I ill 

form  in  the  mean  unit  vector,  i.e.. 


n 


S„  = nVV'y'ViV;.)-'V. 


(2.1) 


i=l 


^ 

In  his  paper,  Randles  chooses  the  matrix  A = A^  that  was  proposed  by  Tyler 


(1987),  to  satisfy: 


1 " 

-E 


AdYj 

AdYi 


AaY, 

AdYi 


= 

P 


(2.2) 


A 

Beside  being  nonsingular,  A^  is  also  invariant  under  sign  changes  among  the  yj’s. 


A 

Moreover,  A^  satisfies  the  following  affine-equivariance  property: 


D'A^yADyD  = CoAyAy,  (2.3) 

^ ^ 

where  D is  a fixed  nonsingular  p x p matrix.  Any  is  the  A calculated  on  the 

transformed  observations  Dy^,  and  Co  is  a positive  scalar  that  may  depend  on  D 
and  the  yj’s.  Equation  (2.2)  shows  that  the  unit  vectors  of  the  transformed  data 
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12 


have  a variance-covariance  structure  like  that  of  a random  variable  that  is  uniform 
on  the  p-unit  sphere.  Tyler  (1987)  not  only  showed  that  is  unique  if  the  sample 
is  drawn  from  a continuous  p-dimensional  distribution  and  n > p{p  — 1)  but 

A 

also  proved  that  is  consistent  and  asymptotically  normal.  In  Appendix  A,  an 
iterative  scheme  for  computing  is  described,  it  was  given  by  Tyler  and  it  proved 
to  compute  fast  for  data  in  any  practical  dimension  using  OX,  the  object  oriented 
language.  With  this  choice  of  A,  (2.1)  becomes 


Sn  = npV  V 


(2.4) 


and  Ho  is  rejected  for  large  values.  The  test  statistic  Sn  is  affine-invariant.  Under 
Ho,  Sn  is  distribution-free  for  the  class  of  distributions  with  elliptical  directions 
and  conditionally  distribution-free  for  the  broader  class  of  directionally  symmet- 
ric distributions.  Randles  (2000)  proved  that  if  the  underlying  distribution  is 
directionally  symmetric  around  0,  then  xl- 

We  now  propose  a class  of  signed-rank  statistics  related  to  (2.1)  and  (2.4). 
These  statistics  use  the  ranks  of  the  distances  of  the  transformed  observations  from 


the  origin.  Specifically,  let  Qi  be  the  rank  of  ||AdYi||  among  llAdYi 


A,Yn 


and  define 


Q, 


n + 1 


V- 


where  ^{•)  is  a nonnegative,  nondecreasing,  uniformly  bounded  continuous  function 


that  may  depend  on  the  dimension  p.  The  test  statistic  is  then  the  quadratic  form 


= nV;(n-i  V3*V;,)-iV,.  (2.5) 

i=l 

and  Ho  is  rejected  for  large  values  of  Wnp-  The  test  statistic  W„p  is  affine-invariant 
as  long  as  n > p{p  — 1)  (see  next  section).  Moreover,  if  under  Ho  the  Yj’s  are  iid 

d 

from  a symmetric  density  (i.e.  Yf  = -Yj),  then  Wnp  is  conditionally  distribution- 
free  (conditioning  on  the  observed  ±yj’s).  Using  the  fact  that  Aj,  and  thus  the 
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QiS,  are  invariant  under  changes  in  the  signs  among  the  yi’s,  the  conditional  null 
distribution  of  Wnp  is  easy  to  generate.  If  we  let  5i  = ±1  for  i = 1,  • • • , n,  then 
these  2"  equally  likely  choices  for  the  (5-vector  yield  2”  possible  conditional  values  of 
the  test  statistic  via: 


^ 


n 


n 


■E 


4’ 


i=l  ■- 


Qi 

n -I- 1 


1 


-1 


ViV. 


5iV 


I 


where  qi  and  Vj  are  the  observed  values  of  Qi  and  Vi,  respectively  i — 1,  • • • , n. 
Finally,  Ho  is  rejected  at  level  a if  the  observed  Wnp  value  is  among  the  largest 
100  • a percent  of  the  2”  equally  likely  values  of 


2.2  The  Choice  of  the  Function 

The  (p{-)  function  plays  an  important  role  in  determining  the  Pitman  Asymp- 
totic Relative  Efficiency  (ARE)  of  Wnp-  If  4>{u)  = 1,  for  all  u,  then  Wnp  reduces 
to  Randles  (2000)  sign  test.  In  such  a case,  and  for  elliptically  symmetric  distri- 
butions, Wnp  will  have  the  same  ARE  as  the  tests  proposed  by  Randles  (1989) 
and  Hettmansperger  et  al.  (1994).  Recall  that  these  tests  have  higher  efficiencies 
than  Hotelling’s  for  heavy-tailed  distributions  but  do  not  perform  as  well  for 
light-tailed  distributions.  Now  if  (j>{u)  = u,  for  all  u,  then,  for  elliptically  sym- 
metric distributions,  Wnp  will  have  the  same  ARE  as  Peters  and  Randles  (1990) 
signed-rank  test  which  performs  better  than  Hotelling’s  for  both  light-tailed  and 
heavy-tailed  distributions  and  virtually  as  well  a.s  Hotteling’s  under  the  multi- 
variate normal  distribution  when  p = 2 or  3.  However,  as  the  dimension  increases, 
not  only  does  the  test  of  Peters  and  Randles  become  computationally  difficult  but 
also  its  performance  deteriorates,  especially  in  the  light-tailed  cases  and  the  normal 
case.  In  fact  the  Pitman  efficiency  of  this  test  relative  to  Hotelling’s  goes  to  .75 
as  p 00  for  any  distribution  in  the  power  family  class. 
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In  this  dissertation,  the  function 

0(u)  = Am  + (1- A)l,  (2.6) 

is  recommended.  Here,  A = {ln(p  + e — 1)}“^  depends  on  p,  the  dimension  of 
the  data.  When  using  this  function,  the  test  statistic  will  be  denoted  by 
Winp.  This  choice  is  motivated  first  by  the  simplicity  of  a linear  function  and 
second  by  the  fact  that  when  p = 1,  <f){u)  = u and  the  test  is  the  two-tailed 
Wilcoxon  signed  rank  test.  But  the  most  important  motivation  for  this  choice 
is  the  excellent  asymptotic  relative  efficiencies  of  Winp  established  in  Chapter  3. 
When  the  underlying  distribution  is  the  multivariate  normal,  for  example,  the  tests 
by  Peters  and  Randles  and  by  Hossjer  and  Croux  have  ARE’s,  when  compared  to 
Hotelling’s  T^,  that  approach  .75  as  p — > oo,  while  the  ARE  of  Winp  approaches 
1 as  p oo.  Moreover,  the  ARE’s  of  Winp  are  strong  across  a wide  spectrum  of 
distributions  from  very  light-tailed  to  very  heavy-tailed  population  models. 

2.3  Affine-Invariance  of  Wnt, 

In  this  section  the  affine-invariance  property  of  Wnp  is  established.  This  is 
done  by  proving  the  following  theorem. 

Theorem  2.3.1.  7/D  is  apx  p nonsingular  matrix,  then 

Wnp{Y,,  ■■■,Yn)  = W„p(DYi,  • . • , DY„), 

where  W„p(Yi,  • • • , Y„)  denotes  the  proposed  statistic,  Wnp,  applied  to  the  observa- 
tions Yi,  • • • , Y„. 

Proof.  Following  the  notation  in  Randles  (2000),  let  Ayd  and  Axd  denote 
computed  on  the  y^’s  and  Xj’s  respectively,  where  Xj-  = Dy^  for  i = 1,  • • • , n.  By  the 
affine-equivariance  property  (2.3)  of  A^, 

D AjjjjA-xdD  = CgAydAyd, 
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where  Co  is  a positive  scalar.  Therefore,  for  i = 1,  • • • , n we  have 


= (y-D'A;,^AxdDyf) 

= cl  ^yjA^^Aydyi) " 


= c| 


^ydVi 


The  above  equality  along  with  the  fact  that  Cq  is  positive  implies  that  the  rank 


of 


AydYl 


among 


is  equal  to  the  rank 


•^ydYi 


among 


A 

Aydyn  ,i  = 1, • • • ,n.  Moreover,  Randles  (2000)  proved  that 


V-  = 

^ zx  — 


AxdX| 


= 0 


•^ydYi 


=z  ©V 

w V ly, 


_i  - 


where  0 ==  c©  ^ AxdDAyJ  is  an  orthogonal  p x p matrix.  Thus,  it  is  true  that 


V six  — 4^ 


Q 


n + l 


' Vfx  = &(t>  ( 1 Vi.  = 0V.i 


n + l 


»y 


8%y. 


Therefore, 


Wnp(DYi,---,DYn)  = Wnp(Xi,---  ,Xn) 


n 


= v.(,vv,)->  v„ 


Z=1 


n 


= nv;,e'(n-‘  5]  0v,i,v;,,0')-‘ev.y 


i=l 


n 


= nv;y0'0'-'(n-‘  V V,i,,v;,,)-'0-‘0V„ 


2=1 


n 


2=1 


= VTnp(Yi,-.-  ,Yn). 


This  completes  the  proof. 


□ 
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2.4  The  Asymptotic  Null  Distribution  of  Wno 
In  this  section,  the  asymptotic  null  distribution  of  Wnp  under  the  class  of 
symmetric  distributions,  is  investigated.  To  do  so,  some  notation  is  introduced.  For 
i = 1,  - • • ,n,  let 


= (!>  {G{Ri))  \io 


where  Ri  = Y^A'^AoVi,  G{t)  = PnoiRi  < t),  {t  > 0),  and  A„  is  the  unique  solution 
among  upper  triangular  matrices  with  1 in  the  upper-left  element  satisfying 


In  section  3 of  his  paper,  Tyler  (1987)  showed  that 


(Ad  - A„)  = Op(l).  (2.8) 

Now,  consider  V*  = - y^^_i  Ysi  and  its  approximating  statistic  V^o  = ^ JZLi  ^sio- 
In  order  to  establish  the  asymptotic  null  distribution,  the  following  lemma  is 
needed. 

Lemma  2.4.1.  Under  Ho,  if  Yi,  - - • , Y„  are  iid  from  a distribution  that  is 
symmetric  around  0,  then 

(V,  - Yso)  = Op(l).  (2.9) 


and 


(2.10) 


Proof.  We  start  the  proof  by  showing  that  under  Ho  and  for  all  i, 


G„  (YjAiAjYi)  A a (y;a;a„y,)  , 


(2.11) 
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where  Gn(t)  = ^ EILi ^ (llAdY^lp  <t),so  Gn  (Y^A^^A.Yi)  = Q./n.  Applying 
lemma  2 of  Peters  and  Randles  (1990)  to  ( 2.11),  it  is  seen  that  we  need  to  prove 


the  two  conditions: 


(A'.Ad  - a;a„)  = Op(i) 

sup|Y;(A;A„  + n-'/2s)Yi-Y;A;A„Yi|  = 0^(1), 

sE^ 


(2.12) 

(2.13) 


where  is  the  set  of  all  p x p matrices  in  a bounded  sphere  centered  at  Opxp-  Tyler 
(1987)  essentially  proved  (2.12)  (see  theorem  3.2  on  page  243).  To  verify  (2.13),  let 
M > 0 be  the  bound  on  each  element  of  the  matrices  in  /3,  then  for  s € ^9  we  have 


Y'iiKAo  + - y;a;a„y, 


< 


Y;n-^/2gy.| 

t=l  t'=l 
t=i  t'=i 


where  s«/  is  the  (t,  t'Y^  element  of  s.  Therefore 


sup  |Y(.(A;Ao  + n-'/2s)Yi  - Y;A' AoYj  < 

sG/9 

Thus  (2.13)  is  established  and  hence  (2.11)  also  follows. 
Now  to  prove  (2.9),  write  -y/n  (V*  — Vgo)  as 


p p 


YitYie  4 0 


t=i  t'=i 


y/n 


v^^{i|:^(;;fl)iv.-vd 


n 


1 " 

-E 

n “ 


i=l  >- 


<l> 


Qi 


n + 1 


<f>iG{Ri)) 


V* 
▼ 1 


to 


(2.14) 


(2.15) 
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The  element  of  the  vector  in  (2.14)  is 


Cfe 


Vn  ^ 

1=1 


Q, 


a'feYi 


aLYi 


n + lj  LllAdYfll  ||A„Yi 


where  ajj.  and  are  the  rows  of  A^  and  A„  respectively.  Let  /(A)  = n^y , 
then  the  derivative  of  the  univariate  function  /(•)  with  respect  to  the  row  of  A 


IS 


B,(i,A,Y) 


«Y)(a;Y)^ 
'lAYII®  * 


AY 


Y KY)"Y 

ayP" 


if  ^ j, 

iik  = j. 


(2.16) 


A Taylor  series  expansion  of  c*  yields 


E 

j=i 


y/n(aj  - ^jo)'- ^2  ^ Bfc(j,  Aik,  Yj) 

i=l  ^ ' 


where  Aik  is  between  Aj  and  Ag.  Conditioning  on  all  Yj,j  ^ i. 


Q 


Q 


Q, 


^ Y*)  ^ j Bfc(i,  Aik,  Yi)cf>  ( ) Bfe(j,  A,k,  Y,) 

are  odd  axial  step  functions  in  Y*  and  thus,  by  lemma  (A.2)  of  Randles  (2000), 
their  conditional,  and  thus  unconditional,  expectation  is  zero.  Hence,  by  Markov’s 
theorem  J 0 (^)  Bfc(j,  Ai*,,  Yi)  A 0 (see  Gnedenko,  1962,  p.  232).  More- 
over, -\/n{a.j  — 3ijo)  is  bounded  in  probability;  hence. 


Cfe  Ao. 


This  proves  that  (2.14)  goes  in  probability  to  zero.  Next,  we  show  that  (2.15)  also 
goes  in  probability  to  zero.  To  do  so,  consider  the  k^^  element  of  that  vector,  say. 


</> 


<5, 


n -|- 1 


'Ha(Ri)) 


A Y 

*^0  ^ t 


and  let 


4> 


Q, 


n -f- 1 


-<f>{G{Ri)) 


<k^i 

\AoYi 


Tik  - 
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Then, 


E[ll]  = i 


n 


'^Tik 
2=1 


-E 

n 


-E 

n 


n 


L 2=1  i<j 


n 


E 

2=1 


ik 


+ ^E 

n 


li<j 


= E{rJk)  + \ (^2) 


The  last  equality  follows  from  the  fact  that  the  terms  in  each  sum  are  exchange- 
able. Conditioning  on  Yj,  j ^ i,  rikVjk  is  an  odd  axial  function  in  Y,-,  which  implies 


that 


ElrikVjk]  = 0. 


Now  consider 


B[r?J  = E 


<!> 


Q, 


n -f  1 


-<t>{G{Ri)) 


<k^i 

AoYA 


Note  that,  i^y*,,  = Op(l)  and  [(j)  (;^)  — (j>{G{Ri))Y  ^ 0 follows  from  (2.11)  and 


2 p 


the  continuity  of  (/>(•).  Therefore,  rfi^  0.  Moreover,  rik  is  uniformly  bounded  and 
thus,  by  the  Lebesgue  Dominated  Convergence  Theorem,  E[r^^]  0.  This  implies 

that  E[ll]  — > 0.  Applying  Markov’s  inequality  (see  Billingsley,  1995,  p.80)  yields 
Ik  — >•  0.  This  completes  the  proof  of  (2.9). 
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To  prove  (2.10)  we  proceed  as  follows.  The  left  side  of  (2.10)  can  be  written  as 


i=l  i=l 

n 


n ^ 


Q\ 


i=l 
n 


n + 1 


; E ■A"  (G(«<))  V(„v;.. 

"m 


= iE^^(s^)[vv:-v.vLi 

1=1  ^ ' 

1 " 

+-E 

n 


2=1  •- 


n + 1/ 

02/  ' ^2 


n + 1 


- r (G(Ri)) 


V- 

^ 20  ^ io* 


(2.17) 


(2.18) 


Now,  write  (2.17)  as 


1 " 
-E'^ 

n i ^ 


Qt 


vn  + 1 

2=1 


[(Vi  - Vi,)(Vi  - YJ  + Vi„(Vi  - Yio)'  + (Vi  - Vi„)vy 


Using  a Taylor  series  expansion,  the  (s,  t)**  element  of  this  matrix  is 


1 " 

-E^ 

n ^ 
2=1 


n + 1 


p 


E(4i-ai«)'B,(j,Ai,Yi) 

■i=i 


+-E^'( 

"tr  V 

1 ” 

+iE^' 

71  it— ^ 


<5, 


n + 1 


lOS 


J](a,.-a,..„)'B,(/,A,,Y,) 

lj'=l 

^(a,  - a„)'B,(j.  A,,  Y,) 

j=l 


Q^ 


\n  + 1 

2=1 


P 


E(&j-ai.)'B,(;,Ai,Y,) 

Li=i 


lot 


1 ” 

= -E<i> 


Q> 


n “ \n  + 1 
2=1 


P 3 


Lj=l  k=l 

P f 


(^jfc  ^jko)  ^sk  Aij  ^2^ 


2=1  ^ 
n 

+-E^^ 

71  < ^ 


Q. 


ijf=i  fc'=i 

p j 


i^j'k'  ^j'k'o)  Bfk'  5 Aj,  ^2^ 


n + 1 


V" 

r t 
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n ^ vn  + 1 
2=1 


Lj=l  k=l 
P 3 


^jko)  ^tk  A^*,  ^2^ 


lj=l  k=l 


i^jk  ^jko)  ^sk  Ai,  ^2^ 


K: 


20t 
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p j p j' 

— ^ ^ ^ ^ ^ ^ ~ ^jko)  (%'fe'  ~ ^j'k'o) 

j=l  k=l  j'=l  k'=l 


j,  Aj,  Yj^  Btk'  (j',  Aj,  Yi 


n + 1 


^iosBfk 


(j,  Ai,Y, 


where  B«  (j,A,Yi)  is  as  defined  in  (2.16).  The  fact  that  (j,A,Yi)  is  uniformly 
bounded  for  A in  some  neighborhood  of  Ao,  and  (A  — A<,)  = Op(l)  imply  that 
the  above  summations  all  go  in  probability  to  zero.  Now  consider  (2.18),  an  (s,t) 
element  of  that  matrix  is 


1 " 
n 


i=l  >- 


<5, 


n + 1 


4?  (G(R,)) 


^ios  ^iot‘ 


(2.19) 


The  scalars  Vios  and  Viot  are  uniformly  bounded  and  [0^  (;^)  ~ (^(^))]  — 

Op(l),  thus  the  elements  of  the  above  summation  are  Op(l).  Note  also  that  the 
terms  in  (2.19)  are  exchangeable  and  uniformly  bounded.  Thus  by  the  Lebesgue 
dominated  convergence  theorem  (see  Serfling,  1980,  p.  11) 


lim  E 

n—>oo 


<f‘ 


Q\ 


n + 1 


- 4?  {G{Ri))  ViosViot 


= 0. 


Moreover, 


Var{wat)  = E 


Q. 


t=i 
n 


n + 1 


- cj?  {G{Ri))  ViosViot 


0‘ 


Qj 


n + 1 
^2/ 


-4>^{G{R,))  V^sVi 


2 t/2 
iot 


n + 


- -4>^{G{Ri)) 


n + 


j)-r  {G{Rj))  J ViosViotVjosVjot 


(2.20) 


(2.21) 
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Clearly  (2.20)  is  Op(l),  since  the  terms  are  exchangeable  and  the  expected  value  is 
bounded.  The  terms  in  (2.21)  are  uniformly  bounded  and  Op(l),  thus, 


lim  E 

n— >oo 


</>' 


Q\ 


n + 1 


4?  (G(m  4, 


Q: 


n + 1 


<^2  {G{Rj))  ViosViotVjosVjot 


= 0. 


An  application  of  Markov’s  theorem  implies  that  each  element  of  the  matrix  (2.18) 


is  Op(l),  hence  (2.18)  is  Op(l). 


□ 


With  lemma  (2.4.1)  established,  we  are  ready  to  find  the  limiting  distribution 
of  Wnp.  This  is  done  by  proving  the  following  theorem: 

Theorem  2.4.1.  // Yi,  • • • , Y„  are  iid  from  a distribution  that  is  symmetric 
around  0 then,  under  Ho,  nWnp  converges  in  distribution  to  a chi-square  distribution 
with  p degrees  of  freedom. 

Proof.  First,  the  limiting  distribution  of  the  approximate  statistic  ^/nVso  is 
established.  Consider  V^o  = ^ ^"=1  </»  ((j(i?j))  Vjo  and  write  Yj  = SiY^.  Here, 

5i  = sign(yii),  where  Yu  is  the  first  component  of  the  vector  Yj,  sign(t)  = 1,  0, 

— 1 as  t > 0,  =,  t < 0,  respectively,  and  Y,*  = 5iYj.  Under  Ho,  5\,--  • ,6n  are  iid 
Bernoulli  random  variables  with  p{Si  = 1)  = p{Si  = —1)  = \.  Moreover,  (5i,  • • • ,S. 


n 


are  independent  of  YJ, • • • , Y*  and  thus  of  Ri  = ||AoYi||, •••  ,R„  = ||AoY„||.  Now 
consider  the  elements  of  the  summation  in  Ygo,  the  conditional  expectation 


E 


4>{G(Ri)) 


A Y 
A Y 


= E 


= E 


(l>{G{Ri))5i 


AoYt 


* 

I 


Ky; 

AoY; 


AoY; 


|y; 


-E 


<l>iG{Ri)) 


ApYt 

AoY* 


0. 


23 


This  implies  that  E {G{Ri))  = 0.  Using  a similar  argument,  we  can 


prove  that 


E 


<!>{G{Ik))4>{G{Rj)) 


A„Yi  / AoY, 


M 


A„Yi  V||A„Y 


= 0 


To  finish  the  proof,  let  fl  = Cov  \^(f>{G{Ri))  ^hen,  by  the  Multivariate 

Central  Limit  Theorem, 

v^V,,~iVp(0,n).  (2.22) 

This  along  with  (2.9)  imply  that  y/nVg  ~ Np{0,ft).  Therefore,  nV'(n)“^V4  has 
a limiting  xj,  null  distribution.  Finally,  a simple  application  of  Slutsky’s  theorem 
implies  that  Wnp  has  a limiting  xl  null  distribution.  This  is  due  to  the  fact  that 
Z"=i  ^sioV'sio  - = Op(l)  and  result  (2.10).  □ 


CHAPTER  3 
TEST  COMPARISONS 


3.1  Asymptotic  Distribution  of  Wnv  Under  Contiguous  Alternatives 
In  order  to  compute  Pitman  asymptotic  relative  efficiencies  of  Wnp,  the 
asymptotic  distribution  of  Wnp  under  contiguous  alternatives  is  obtained.  Thus, 
a sequence  of  alternatives  approaching  the  null  hypothesis  is  considered.  In 
particular,  consider 

Hn-.0  = 9n  = -^,  (3.1) 

y/n 

where  c = (ci,  •••  ,Cp)  is  a constant,  yet  arbitrary,  vector.  In  this  chapter, 
the  asymptotic  results  are  developed  for  the  case  when  the  Yj’s  come  from  the 
class  of  elliptically  symmetric  distributions.  Thus,  each  Y{  can  be  expressed  as 
Yj  = RjDU,-  + 0 where  D is  a fixed  nonsingular  p x p matrix,  the  Uj’s  are  iid 
uniformly  distributed  on  the  unit  p-sphere,  and  the  i?j’s  are  iid  positive  random 
variables  independent  of  the  Ui’s  (see  Fang,  Kotz,  and  Ng,  1990,  book).  Moreover, 
if  Y is  elliptically  symmetric,  then  the  density  function  of  Y is  of  the  form 


My)  = ^fp|S|-5ft{(y  - 9)'S-'(y  - 9)},  (3.2) 


where  Kp  > 0,  and  S is  proportional  to  the  covariance  matrix  of  Y,  provided  it 
exits.  Without  loss  of  generality,  take  D = I,  the  p x p identity  matrix  which 
implies  that  S = I.  This  is  valid  because  Wnp  is  affine-invariant.  Two  specific 
families  of  elliptically  symmetric  distributions  are  considered.  The  first  is  called  the 
power  family  which  has  the  form  in  (3.2)  with 


h{t)  = exp  < — 


t 

Co 


u > 0, 
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where 


— 


pT{pl2v) 
V{{p  + 2)l2vy 


uT{pl2) 

T{pI2u)[kcYI^' 


and 


POO 

T{w)  = / 

Jo 


w > 0. 


This  family  includes  the  multivariate  normal  density  {u  = 1),  as  well  as  heavier- 
tailed  distributions  (0  < < 1)  and  lighter-tailed  distributions  {u  > 1).  The  second 

is  the  multivariate  t-distribution  where  h{-)  and  Kp  in  (3.2)  are  defined  as 


h{t) 


ip+v)/2 


and 


r((p+i/)/2) 

''  r(i//2)(iri/)i>/2 


(3.3) 


Before  the  asymptotic  distribution  of  Wnp  is  obtained,  a few  comments  are 
essential.  First,  note  that  for  elliptically  symmetric  distributions  and  under  Ho, 
Ao,  the  matrix  defined  in  section  2.3  is  such  that  A„  = I.  This  is  easily  seen  from 
condition  (2.7)  and  the  uniqueness  of  Aq.  Second,  the  variance-covariance  matrix 


ft  defined  in  (2.22)  is  such  that 


ft  = kL 


pxp? 


(3.4) 


where, 

k = -E„,  [<^"(G(R,.))] . 

P 

The  next  theorem  describes  the  distribution  of  Wnp  under  (3.1)  for  the  general 
elliptically  symmetric  distribution. 

Theorem  3.1.1.  7/ Yi,  • • • , Y„  are  iid  p-variate  vectors  from  an  elliptically 
symmetric  distribution  whose  density  is  given  in  (3.2),  then  under  the  alternatives 
defined  in  (3.1),  Wnp  has  a limiting  non-central  chi-square  distribution  withp 


degrees  of  freedom  and  non- centrality  parameter 


- (r) 


T7»2 
4.  ^Hc 


v’  ElW(G(R))] 


where 


h'{R^)  = 


dh{t) 

dt 


\t=R^ 


Proof  Without  loss  of  generality,  consider  the  case  S = I.  Under  Ho,  the 
distribution  of  the  Y’s  has  the  form 


/(y)  = /o,i(y)  = Kph{y'y). 


Hence, 


dtf{y)  = Kp2yth'{y'y) 


and 


dtf{y)  ^ 


/(y) 


h(y'y) 


Following  the  rationale  in  Chapter  7 of  Hajek,  Sidak  and  Sen  (1999),  let 


_J_  ^ / r^/(y) 

Vn  ^ L /(y)  . 


n 


2c' 

y/n  f-' 
^ 1=1 


h'iy'^yi) 

. h{y'iyi) . 


where  c is  as  defined  in  (3.1),  and 


Lr,  = V^(xi-f]  cj>{G{Ri))Ua,  • • • , Api  V <t>{G{Ri))Uip 

V 


• • • 
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Consider 


Dn  — “1“  Cl2Tn 

= y/naiXi{—  (/>{G{Ri))Uii)  + h y/naiXp{—  (l>{G{Ri))Uip) 

i=l 

2o2  ^ , /i'(y-yi) 


n 


t=l 


+ ^ XI  cVi 


HViYi) 


- n 1 ^ 

= y/naiXi{—  0(G^(-Ri))f7ii)  + h \/naiAp(—  (f>{G{Ri))Uip) 

Ti.  . 

z=l  Z=1 


n 


+ ^ V c'B,U, 


fc(R?) 


^ 1=1 


(^2a2CpRij^^^  + aiXp(j){G{Ri))^  Uip 


Each  term  on  the  right  has  zero  expectation  because  the  Ri  and  Uu  are  indepen- 
dent and  En^lUit]  = 0,  for  all  1 < i < n and  all  1 < t < p.  Moreover,  the  terms  in 
the  bracket  are  uncorrelated  because  EHolUitUw]  = 0,  for  all  t 7^  t'.  Now  consider 


crtt 


= E 


He 


2a2CtRi^^^  + aiXi<j>{G{Ri))^  Uf, 


= E 


Ho  \Pi^  I (2a2C()^  Ehc 


Ri 


h'(Rf) 

h(Rh  J 


-I-  2 {2a2Ct){a\Xt)EHo 


<f>{G{Ri))Ri 


h'jRl) 

h{Rl)  J 


+ 


— -{  i‘^0,2Ct)  Eh^ 


Ri 


mh 

h(RI)  J 


-t-  2 (2a2Ct)(aiAt)£'/f^ 


<f>iG{Ri))Ri 


h'jRl) 
h{Rl)  J 


+ 


Therefore,  by  the  multivariate  central  limit  theorem 


Dn  -4  iV(0,  <Tii  -I + (Tpp), 
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where 


1 


(Til  + • • • + (Tpp  — -(202)  E[j^ 


P 


Ri 


ml) 

h(R?)  j 


t2  P 


E 


t=l 


+ 

P 


(f>{G{Ri))Ri 


h'jRl) 

HRl)  J 


'^CtXt 


t=l 


t=l 


Note  that  L„  = y/nX'Ygoi  where  A = (Ai,  • • • , \p).  Thus,  using  (2.22)  and  (3.4),  it 


is  clear  that 


L„  4 JV(0,  -Eg,  [^"(G(iii))]  -'?)■ 

^ t=l 


Moreover,  T„  can  be  written  as  T„  = ;^  XliLi 


D h'(Rj) 


c'Uj,  thus 


(3.6) 


Eho  [Tn]  = 0, 


and. 


Var  (T„)  = a^  = -Eh„ 

P 


Ri 


mi) 

h{Si)  J 


n 2 P 


t=i 


Moreover,  Ic{f)  = ctt,  where  Idf)  represents  the  Fisher  information  of  /(•). 
Therefore  it  follows  from  Le  Cam’s  third  lemma  that,  under  the  alternatives  in 


(3.1), 


i„  4 AT  ( -Eh, 

P 


h'jRl) 

J 


c'A,  [.^"(G(iSi))]  A'A  ) , 
P 


and  thus 


^/n'V  sQ 


N I -Eff^ 
P 


(j>{G{Ri))Ri 


h'jltl) 
hjRl)  J 


c,  -Eh,  [.^"(G(B,))1  I 


Therefore,  under  the  contiguous  alternatives  in  (3.1), 


w^p-^xl.  I (r)^" 


R,l00i^(G(R,)) 


p'  E%WjG{Ri))] 


c c 


(3.7) 


□ 


Corollary  3.1.1.  // Yi,  • • • , Y„  are  iid  from  the  power  family  of  distributions, 
then  under  the  alternatives  defined  in  (3.1),  Wnp  has  a non-central  chi-square 
distribution  with  p degrees  of  freedom  and  non- centrality  parameter 
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2^/  VEj,U{G{R))R 


3iw  - 


2i/-l 


E„,ma(R))] 


Ic'c 


(3.8) 


Proof.  The  proof  follows  from  theorem  3.1.1  and  the  fact  that  for  the  power  family 

h{t)  = exp{-(^)‘^}, 


and 


h'{t) 

h{t) 


^exp{-(^r> 


.1/ 


□ 


Corollary  3.1.2.  //Yi,  • • • , Y„  are  iid  from  the  multivariate  t- distribution  family, 
then  under  the  alternatives  defined  in  (3.1),  Wnp  has  a non-central  chi-square 
distribution  with  p degrees  of  freedom  and  non- centrality  parameter 


hw  — 


V v^i' 


Eh.  [^^(G(/i))] 


c c 


Proof.  The  multivariate  t-distribution  is  such  that 


h{t)  = 


V 


(3.9) 


and 


m ^ (-^K^)  [i+a 

hit) 


ML 

[1  + ^]' 


[i+J] 


E±il 


Thus  the  proof  follows  readily. 


□ 
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3.2  ARE  of  Wnr, 

The  Pitman  asymptotic  relative  efficiency  of  two  test  statistics  S and  T,  say 
ARE(S',  T),  is  defined  in  the  following  fashion.  Let  Hn  : 0 = On  be  a sequence 
of  alternatives  approaching  the  null  hypothesis,  which,  without  loss  of  generality, 
can  be  taken  as  Ho  : 0 = 0,  and  consider  the  two  test  sequences  {5m„}  and  {T,n*} 


associated  with  this  set  of  alternatives.  Here,  m„  and  m*  are  the  sample  sizes 
required  for  the  two  sequences  of  tests  to  satisfy 


0 < q:  < lim 

n—^oo 


= lim  Pt^.  < 1 

n-^oo 


where  a is  the  the  common  limiting  significance  level  of  the  two  test  sequences  and 
Psm  ) Pt  . are  the  powers  of  (On)  and  Tm-  (On)  (i-6.  the  probabilities  that  the 
tests  Sm„  and  T^.  reject  Hg  when  0 = 0„.).  Then  the  asymptotic  relative  efficiency 
of  S relative  to  T is 

777* 

Ai?E(5,T)  = lim (3.10) 

n->oo 

provided  the  limit  exists  and  is  the  same  for  all  {0„},  {m„}  and  {m*}.  Note  that, 
if  the  test  sequences  and  } are  asymptotically  non-central  chi-square 


with  p degrees  of  freedom  and  non-centrality  parameters.  As  and  Ay,  respectively, 
then 


ARE(S,  T) 


(3.11) 


(see  Hannan,  1956). 

Before,  the  ARE’s  of  Wnp  with  respect  to  Hotelling’s  are  obtained,  the  non- 
null asymptotic  distributions  of  Hotelling’s  for  the  two  families  of  distributions 
are  given  in  the  following,  well-known,  theorems  (see  Mdttonen  et  al.,  1998). 
Theorem  3.2.1.  If  Yi,  • • • , Y„  are  iid  from  the  power  family  of  distributions,  with 
S = I,  then  under  the  alternatives  defined  in  (3.1),  Hotelling’s  has  a non-central 
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chi-square  distribution  with  p degrees  of  freedom  and  non- centrality  parameter 


Sij.  = c'c. 


(3.12) 


Theorem  3.2.2.  If  Yi,  - • • , Y„  are  iid  from  the  multivariate  t- distribution  family, 
with  S = I,  then  under  the  alternatives  defined  in  (3.1),  Hotelling’s  has  a 
non-central  chi-square  distribution  withp  degrees  of  freedom  and  non- centrality 


parameter 


^2t  = ( - — - ) c'c, 


(3.13) 


where  u is  the  degrees  of  freedom  of  the  t- distribution. 

Now,  the  ARE’s  of  Wnp  with  respect  to  Hotelling’s  are  given  in  the 
following  theorems. 

Theorem  3.2.3.  If  Yi,  - - - , Y„  are  iid  from  the  power  family  of  distributions, 
then,  for  testing  Ho  : 0 = 0 against  the  alternative  in  (3.1), 


ARE(W„„  T*)  = 


Eh.  l^=(G(ii))l 


pi-l^cv 


where 


(3.14) 


Eh.  = 


2A'p7T'’/2 

r())/2) 


[/ 


00 


'Wm 


r,P-e-(£)" 

Jo 


2\t' 


dt  1 dr 


and 


EHA<f>\G(R))]  = [\j>\u)du. 

Jo 


, (3.15) 


(3.16) 


Proof.  Without  loss  of  generality,  take  S = I.  Result  ( 3.14)  follows  immediately 
from  corollary  3.1.1,  theorem  3.2.1,  and  result  (3.11).  To  prove  (3.15)  and  (3.16), 
recall  that  G(-)  represents  that  cumulative  distribution  function  of  the  Ri’s.  Thus, 
for  the  power  family  of  distributions,  t = B?  has  the  density 


i(t)  = 


V 


r(p/2) 


.1/ 

'O 


(3.17) 
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where  Kp  and  Cg  are  defined  in  (3.3).  Let  R = y/i  then  dR  = and  the 


density  of  R is 


r(p/2) 


^ exp 


.2« 


*V 

'O 


(3.18) 


Thus,  (3.15)  follows  readily  from  (3.18).  To  complete  the  proof,  write 


poo 

Eh.  [(^2(G(/?))]  = / (f>‘^{G{r))g{r)dr. 

Jo 


Let  u = G{r),  then  du  = g{r)dr  and  (3.16)  is  easily  established. 


□ 


Theorem  3.2.4.  If  Yi,-  • • , Y„  are  iid  from  the  t- distribution  family  of  distribu- 


tions,then,  for  testing  Hg  : 0 = 0 against  the  alternative  in  (3.1), 


ARE{Wnp,T^)  = 


V 


p + uVEl^[m{R))RI{-^  + lR^)] 


v-2 


y/p^ 


Eh.  mG{R))] 


(3.19) 


where 


E 


H. 


4>(G(R))RK\  + -R^) 


2r(E±i:) 


r(p/2)r(i//2)«/i>/2 


poo 

Jo 


4> 


2T(’^) 


r(p/2)r(../2)i^p/2 


/ 


-1 


J [1+?1 


dt 


.p-i 


dr 


(3.20) 


and  Eh.  [0^(G(i?))]  is  as  defined  in  (3.16). 


Proof.  Without  loss  of  generality,  take  S = I.  Result  (3.19)  follows  immediately 
from  corollary  3.1.2,  theorem  3.2.2,  and  result  (3.11).  To  prove  (3.20),  let  ^ 
then  t is  distributed  as  an  F-distribution  with  p numerator  degrees  of  freedom  and 
u denominator  degrees  of  freedom  (see  Mdttonen  et  al.,  1998,  p.l25).  Thus,  the 
density  function  of  t is 

. , r(2±i^)  , , ,2 

“ T{p/2)r(v/2)  ^1''^  (l  + (p/^)t]“f  ■ 
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Taking  r = then  dr  = \{pt)  and  R has  a density  of  the  form: 

2r  (^)  \ rP-i 

r{p/2)r{u/2)up/^ ) [1  + ^]^' 

Note  that  (3.20)  follows  readily  from  (3.21).  This  completes  the  proof.  □ 

Table  3.1  displays  the  Pitman  asymptotic  relatives  efficiencies  of  the  new 
signed-rank  statistic,  Winp,  Randles  (2000)  sign  test,  Sn,  the  signed-rank  statistic 
due  to  Peters  and  Randles  (1990),  PRn,  and  the  signed  sum  statistic  due  to  Jan 
and  Randles  (1994),  JRn,  relative  to  Hotelling’s  T^,  for  the  case  when  the  Y’s 
come  from  the  power  family  of  distributions.  Table  3.2  contains  the  efficiencies  of 
^inp,  Sn^  PRni  and  the  signed-rank  test  of  Hettmansperger  et  al.  (1994),  HNOn, 
relative  to  Hotelling’s  T^,  for  the  case  when  the  Y’s  come  from  the  family  of 
multivariate  t-distributions.  Note  also  that  the  interdirection  sign  test  of  Randles 
(1989)  has  the  same  efficiencies  as  Sn  and  the  test  by  Hossjer  and  Croux  (1995)  has 
the  same  efficiencies  as  PRn- 

It  is  clear  from  tables  3.1  and  3.2  that  the  new  signed-rank  test  statistic  has 
strong  efficiencies  among  a variety  of  distributions,  ranging  from  very  heavy-tailed 
distribution  to  very  light-tailed  ones.  It  is  also  seen  that  these  efficiencies  do  not 
deteriorate  when  the  dimension  increases  as  is  the  case  for  the  test  proposed  by 
Peters  and  Randles  (1990).  For  normal  alternatives  (Table  3.1  with  i/  = 1 or 
Table  3.2  with  degrees  of  freedom  going  to  oo),  not  only  performs  virtually 
as  well  as  Hotelling’s  but  also  has  higher  efficiencies  than  the  other  tests,  5„, 
PRn,  JRn,  and  HNOn  when  p > 2 (with  the  exception  of  PRn,  when  p = 2). 

For  the  heavy-tailed  distribution  of  the  power  family  (that  is  u < 1),  Winp  has 
higher  efficiencies  than  both  PRn  and  JRn  but  slightly  lower  efficiencies  than  those 
of  Sn,  especially  when  the  dimension  increases.  For  the  light-tailed  distributions 
of  this  family  (that  is  i/  > 1),  the  efficiencies  of  Winp  exceed  those  of  JRn  and 
Sn  but  are  less  than  those  of  PRn-  However,  this  slight  advantage  of  PRn  in  the 


(3.21) 
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Table  3.1;  Asymptotic  Relative  Efficiencies  Under  the  Power  Family  of  Distribu- 
tions 


V 

Dimension 

Test 

.2 

.5 

1 

2 

5 

1 

^^Inp 

11.949 

1.500 

.955 

.873 

.907 

PRn 

11.949 

1.500 

.955 

.873 

.907 

JRn 

11.949 

1.500 

.955 

.873 

.907 

Sn 

127.451 

2.000 

.637 

.411 

.347 

2 

^^Inp 

5.739 

1.332 

.971 

.931 

1.001 

PRn 

3.003 

1.130 

.985 

1.051 

1.221 

JRn 

4.188 

1.296 

.966 

.883 

.888 

Sn 

11.597 

1.500 

.785 

.590 

.519 

3 

^^Inp 

3.547 

1.240 

.978 

.950 

1.025 

pfL. 

1.818 

1.000 

.975 

1.099 

1.346 

JRn 

2.741 

1.211 

.974 

.898 

.874 

Sn 

5.346 

1.333 

.849 

.688 

.620 

4 

^^Inp 

2.688 

1.186 

.982 

.961 

1.034 

PRm 

1.402 

.940 

.961 

1.109 

1.395 

JRn 

2.168 

1.164 

.978 

.912 

.878 

Sn 

3.593 

1.250 

..884 

.749 

.687 

5 

^^Inp 

2.247 

1.151 

.985 

.968 

1.038 

PRn 

1.197 

.900 

.949 

1.106 

1.412 

JRn 

1.902 

1.136 

.981 

.922 

.896 

Sn 

2.816 

1.200 

.905 

.790 

.734 

10 

^^Inp 

1.523 

1.077 

.993 

.987 

1.040 

PRn 

.875 

.825 

.907 

1.061 

1.365 

JRn 

1.388 

1.071 

.990 

.953 

.934 

Sn 

1.702 

1.100 

.951 

.885 

.849 

20 

^^Inp 

1.234 

1.037 

.997 

.997 

1.033 

PRn 

.760 

.788 

.869 

.995 

1.245 

JRn 

1.181 

1.036 

.994 

.974 

.962 

Sn 

1.310 

1.050 

.975 

.940 

.920 

Entries:  Pitman  asymptotic  relative  efficiencies  with  respect  to  Hotelling’s 


Distribution 

PRn 

JRn 

Sn 


Power  family. 

New  procedure  with  the  linear  </>(•)  function. 
Peters  and  Randles  procedure. 

Jan  and  Randles  procedure. 

Randles  (2000)  procedure. 
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Table  3.2:  Asymptotic  Relative  Efficiencies  Under  Multivariate  i-Distribution 


Degrees  of  freedom 

Dimension 

test 

3 

6 

10 

20 

OO 

1 

^^Inp 

1.900 

1.164 

1.054 

.997 

.955 

PRn 

1.900 

1.164 

1.054 

.997 

.955 

HNOn 

1.900 

1.164 

1.054 

.997 

.955 

JRn 

1.900 

1.164 

1.054 

.997 

.955 

Sn 

1.621 

.879 

.757 

.690 

.637 

2 

^^Inp 

1.954 

1.181 

1.068 

1.011 

.971 

PRn 

1.748 

1.123 

1.041 

1.005 

.985 

HNOn 

2.026 

1.196 

1.064 

.992 

.937 

JRn 

1.952 

1.185 

1.069 

1.011 

.966 

Sn 

2.000 

1.084 

.934 

.851 

.785 

4 

^^Inp 

2.055 

1.209 

1.084 

1.023 

.984 

PRn 

1.533 

1.018 

.964 

.952 

.961 

HNOn 

2.173 

1.241 

1.088 

1.004 

.937 

JRn 

2.002 

1.207 

1.088 

1.025 

.978 

Sn 

2.250 

1.220 

1.051 

.958 

.884 

6 

^^Inp 

2.120 

1.228 

1.095 

1.029 

.989 

PRn 

1.423 

.953 

.911 

.911 

.938 

HNOn 

2.256 

1.270 

1.108 

1.018 

.947 

JRn 

2.024 

1.218 

1.095 

1.031 

.984 

Sn 

2.344 

1.271 

1.094 

.997 

.920 

10 

^^Inp 

2.194 

1.251 

1.107 

1.037 

.993 

PRn 

1.315 

.882 

.848 

.857 

.907 

HNOn 

2.346 

1.304 

1.132 

1.037 

.961 

JRn 

2.045 

1.228 

1.103 

1.038 

.990 

Sn 

2.422 

1.313 

1.131 

1.031 

0.951 

Entries:  Pitman  cisymptotic  relative  efficiencies  with  respect  to  Hotelling’s  T^. 
Distribution  = Multivariate  t-distribution. 

= New  procedure  with  the  linear  (/>{•)  function. 

= Peters  and  Randles  procedure. 

= Hettmansperger,  Nyblom  and  Oja  (1994)  procedure. 

= Jan  and  Randles  procedure. 

= Randles  (2000)  procedure. 


^^Inp 

PRn 

HNOn 

JRn 

Sn 
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light-tailed  cases  disappears  slowly  as  the  dimension  increases.  Note  that  as  the 
dimension  goes  to  infinity,  Winp  becomes  Sn,  and  the  efficiencies  of  Winp,  JRn,  and 
Sn  approach  1,  while  those  of  PRn  go  to  .75. 

For  the  multivariate  t-distribution  family,  Winp  has  higher  efficiencies  than 
PRn  for  all  degrees  of  freedom.  This  confirms  the  previous  results  since  the  t- 
distribution  is  heavier-tailed  than  the  normal  distribution.  As  compared  to  HNOn, 
Winp  has  slightly  smaller  efficiencies  when  the  degrees  of  freedom  are  less  than  20 
and  larger  efficiencies  otherwise.  The  sign  test,  Sn,  is  designed  for  heavy-tailed 
cases  and  has  slightly  higher  efficiencies  than  Winp  as  a result.  The  eflSciencies 
of  Winp  are  slightly  larger  than  those  oi  JRn  for  small  degrees  of  freedom  and 
virtually  the  same  for  the  higher  ones.  Here  also,  when  the  the  dimension  increases 
to  infinity,  the  efficiencies  of  all  the  tests,  with  the  exception  of  PRn,  approach  one. 

3.3  Monte  Carlo  Results 

A Monte  Carlo  study  for  dimension  p = 2,  sample  size  n = 30,  and  significance 
level  a = .05  was  conducted  with  the  objective  of  investigating  the  performance 
of  Winp.  Also  included  in  the  study  were  Hotelling’s  T^,  the  quadratic  form,  Sn, 
due  to  Randles  (see  2.4),  the  bivariate  signed  rank  test,  BH,  due  to  Brown  and 
Hettmansperger.  These  four  test  statistics  were  compared  under  elliptical  symmet- 
ric distributions  with  densities  from  the  power  family  and  the  family  of  multivariate 
t-distributions.  The  distributions  were  located  at  0 = (kS,kS),k  = 0, 1,2,3. 

The  value  6 was  adjusted  for  different  distributions  to  examine  somewhat  similar 
points  on  the  power  curves.  All  calculations  and  random  variable  generations  were 
performed  on  a Unix  workstation  using  OX,  an  object  oriented  language. 

To  generate  observations  from  distribution  in  the  power  family,  recall  that 
,when  0 = 0 and  S = I,  has  the  distribution  described  in  (3.17).  Taking  W = 
{R?/coY,  it  can  be  easily  seen  that  W has  Gamma(p/2i/,l)  distribution.  Moreover, 
recall  that  each  observation  can  be  written  as  Y = i?U,  where  R is  independent 
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of  U which  is  uniformly  distributed  on  the  unit-sphere  (see  Johnson  (1987)).  Thus 
to  generate  the  Y’s,  the  uniform  vector  U is  generated  via:  U = Z/||Z||,  where  Z 
has  the  bivariate  standard  normal  distribution,  and  R via  R = where  c„ 

is  defined  in  (3.3).  The  entries  in  the  tables  represent  the  proportion  of  times  out 
of  3000  iterations  in  which  each  test  statistic  exceeded  the  upper  a-percentile  of  its 
asymptotic  null  distribution.  In  Table  3.3,  the  results  from  the  first  Monte  Carlo 
study  are  presented.  As  indicated,  3000  samples  of  size  30  are  obtained  from  five 
members  of  the  power  family  of  distributions.  Heavy-tailed  distributions,  i/  = .10 
and  u = .20,  the  bivariate  normal  distribution,  u = 1,  and  light-tailed  distributions. 


u = 2.0  and  u = 10.0,  are  included.  Table  3.4  contains  the  results  from  the  second 
Monte  Carlo  study  where  the  underlying  distribution  is  chosen  from  the  bivariate 
family  of  t-distributions  with  degrees  of  freedom  of  1,  3,  10  and  20. 

The  results  of  the  Monte  Carlo  studies  agree  with  those  of  the  previous 
section.  For  the  bivariate  normal  alternatives.  Hotelling’s  T^,  Winp  and  BH  have 
very  similar  powers,  with  the  power  of  Hotelling’s  being  slightly  larger  than 
the  others.  For  the  heavy-tailed  distributions,  Sn  has  a slightly  larger  power  than 
that  of  Winp  and  BH,  with  Hotelling’s  T^’  s power  significantly  lower  (see  Table 
3.3,  case  o(  u = .10).  For  the  light-tailed  distribution,  Winp  and  Hotelling’s 
maintained  a good  performance  while  that  of  BH  and  declined.  In  summary, 
it  can  be  concluded  that  Winp  maintained  a solid  performance  under  the  heavy- 
tailed, light-tailed  and  multivariate  normal  distributions,  while  the  other  tests’ 
performances  declined  in  one  of  the  three  cases.  For  example.  Hotelling’s  didn’t 
perform  well  under  very  heavy-tailed  distributions,  the  performance  of  quadratic 
form,  Sn,  was  not  good  for  light-tailed  cases,  and  the  powers  of  BHn  declined  for 
distribution  with  lighter  tails  than  the  normal  case. 
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In  the  second  Monte  Carlo  study,  the  dimension  was  increased  to  p = 4,  the 
number  of  iterations  to  5000,  and  the  sample  size  to  n = 50.  In  this  study,  the  gen- 
eralization of  BH  to  the  multivariate  case;  that  is  the  test  due  to  Hettmansperger, 
Nyblom,  and  Oja;  was  not  included  because  it  is  very  difficult  to  calculate  the  test 
statistic  for  dimensions  larger  than  two.  Distributions  from  the  power  family  and 
the  family  of  multivariate  t-distributions  are  considered.  The  results  are  presented 
in  Tables  3.5  and  3.6.  For  the  normal  case,  the  three  tests  perform  very  well  with 
a slight  advantage  to  and  Winp  over  Sn-  For  heavy-tailed  distributions  from  the 
power  family,  the  performance  of  Hotelling’s  deteriorates  dramatically,  while  the 
performance  of  both  Wnp  and  Sn  stays  good,  with  Sn  having  the  higher  power.  For 
the  light-tailed  cases  from  the  power  family,  Winp  and  have  similar  powers  that 
are  higher  than  those  of  For  the  extreme  heavy-tailed  cases  from  the  family  of 
t-distributions,  it  is  clear  that  Sn  and  Winp  have  a much  better  performance  than 
T^,  and  as  the  degrees  of  freedom  increases,  the  powers  of  the  three  tests  become 
virtually  the  same.  Once  again,  it  is  demonstrated  that  Winp  maintains  a very  good 
performance  for  all  distributions  considered,  while  the  other  tests  show  a decline  in 
their  performances  in  some  cases. 

While  the  conclusions  from  the  Monte  Carlo  study  in  dimension  4 are  similar 
to  those  from  the  bivariate  case,  it  is  worth  noting  that  the  time  it  took  for  each 
run  of  the  first  Monte  Carlo  study,  that  is  3000  iterations  of  bivariate  samples  of 
size  30,  is  more  than  36  hours  where,  on  the  other  hand,  a run  from  the  second 
Monte  Carlo  study,  that  is  5000  iterations  of  samples  of  size  50  from  quadrivariate 
distributions  took  less  than  5 minutes.  The  reason  of  this  drop  in  computation 
time  is  the  absence  of  the  test  statistic  of  Hettmasnperger  et.  al  from  the  second 
Monte  Carlo  study.  This  difference  in  the  computation  time  also  demonstrates  the 
ease  with  which  one  can  calculate  the  new  test  statistic  for  data  in  any  practical 
dimension. 
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In  conclusion,  the  test  comparisons,  whether  done  by  using  the  Pitman 
asymptotic  relative  efficiencies  or  Monte  Carlo  studies,  demonstrate  that  the  new 
proposed  multivariate  affine-invariant  signed-rank  test  performs  as  well  as,  if  not 
better  than,  other  leading  competitors.  Moreover,  this  performance  is  robust  to  the 
types  of  distributions  considered,  which  are  taken  from  the  power  family  and  from 
the  family  of  multivariate  t-distributions.  Most  important  is  the  fact  that  the  new 
procedure  is  much  easier  to  implement  than  tests  found  in  the  literature,  with  the 
exception  of  Sn,  which,  by  the  way,  is  a special  case  of  Wnp. 
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Table  3.3:  Monte  Carlo  Results  for  the  Bivariate  Power  Family  of  Distributions 


Test  Statistic 

V 

Amount  of  Shift 

.10 

0 

lA 

2A 

3A 

rp2 

0.0267 

0.0627 

0.1947 

0.3697 

^^Inp 

0.0550 

0.3617 

0.7207 

0.8937 

BH 

0.0560 

0.3017 

0.6557 

0.8530 

Sn 

0.0527 

0.5617 

0.8890 

0.9743 

.20 

rji2 

0.0330 

0.1033 

0.3123 

0.5643 

^^Inp 

0.0460 

0.2213 

0.5850 

0.8433 

BH 

0.0433 

0.2080 

0.5753 

0.8423 

Sn 

0.0413 

0.3120 

0.7267 

0.9243 

1.0 

rp2 

0.0480 

0.1500 

0.5100 

0.8773 

^^Inp 

0.0463 

0.1367 

0.4770 

0.8593 

BH 

0.0463 

0.1350 

0.4693 

0.8487 

Sn 

0.0470 

0.1287 

0.4070 

0.7727 

2.0 

rp2 

0.0537 

0.1410 

0.4607 

0.8430 

^^Inp 

0.0463 

0.1247 

0.4127 

0.7940 

BH 

0.0487 

0.1170 

0.3843 

0.7560 

Sn 

0.0500 

0.1027 

0.2993 

0.6207 

10.0 

fji2 

0.0493 

0.1580 

0.4937 

0.8783 

^^Inp 

0.0437 

0.1407 

0.4560 

0.8343 

BH 

0.0450 

0.1203 

0.3910 

0.7660 

Sn 

0.0477 

0.0987 

0.2877 

0.6087 

Entries:  the  proportion  of  times  out  of  3000  iterations  in  which  each  test 
statistic  exceeded  the  upper  a-percentile  of  its  asymptotic  null  distribution. 
Distribution  = Power  family. 

.05 

1. 

30. 

2. 


a = 

S 

Sample  size  = 
Dimension  = 


rj}2 

^^Inp 

BH 

Sn 


= Hotelling’s  procedure 

= New  procedure  with  the  linear  0(-)  function. 
= Brown  and  Hettmansperger  (1989)  procedure. 
= Randles  (2000)  procedure. 
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Table  3.4:  Monte  Carlo  Results  for  the  Bivariate  t-Distributions 


Test  Statistic 

df 

Amount  of  Shift 

1 

0 

lA 

2A 

3A 

rp2 

0.0130 

0.0393 

0.1117 

0.2317 

^^Inp 

0.0460 

0.1487 

0.4383 

0.7177 

BH 

0.0453 

0.1420 

0.4260 

0.6980 

Sn 

0.0500 

0.1757 

0.5697 

0.8443 

3 

rp2 

0.0427 

0.1440 

0.4580 

0.7797 

^^Inp 

0.0477 

0.1713 

0.5600 

0.8810 

BH 

0.0503 

0.1753 

0.5737 

0.8917 

Sn 

0.0477 

0.1757 

0.5697 

0.8897 

10 

rp2 

0.0420 

0.1453 

0.4903 

0.8543 

^^Inp 

0.0387 

0.1417 

0.4887 

0.8583 

BH 

0.0360 

0.1463 

0.4873 

0.8567 

Sn 

0.0413 

0.1280 

0.4460 

0.8103 

20 

rp2 

0.0513 

0.1433 

0.4910 

0.8510 

^^Inp 

0.0527 

0.1363 

0.4763 

0.8457 

BH 

0.0513 

0.1420 

0.4730 

0.8443 

Sn 

0.0527 

0.1260 

0.4207 

0.7737 

Entries:  the  proportion  of  times  out  of  3000  iterations  in  which  each  test 
statistic  exceeded  the  upper  a-percentile  of  its  asymptotic  null  distribution. 
Distribution  = Multivariate  t-distribution. 
a = .05 

S =1. 

Sample  size  = 30. 

Dimension  = 2. 


fji2 

BH 

Sn 


= Hotelling’s  procedure. 

= New  procedure  with  the  linear  0(-)  function. 
= Brown  and  Hettmansperger  (1989)  procedure. 
= Randles  (2000)  procedure. 
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Table  3.5:  Monte  Carlo  Results  for  the  Multivariate  Power  Family  of  Distributions 


Test  Statistic 

V 

Amount  of  Shift 

.10 

0 

lA 

2A 

3A 

rji2 

0.0280 

0.0450 

0.1194 

0.2504 

^^Inp 

0.0436 

0.1646 

0.4868 

0.7702 

Sn 

0.0426 

0.2214 

0.6116 

0.8796 

.20 

rp2 

0.0380 

0.0922 

0.3058 

0.6142 

^^Inp 

0.0422 

0.1658 

0.5370 

0.8688 

Sn 

0.0402 

0.1988 

0.6350 

0.9266 

1.0 

rp2 

0.0524 

0.1486 

0.5198 

0.8992 

^^Inp 

0.0472 

0.1298 

0.4862 

0.8800 

Sn 

0.0466 

0.1254 

0.4466 

0.8432 

2.0 

rji2 

0.0558 

0.1486 

0.5128 

0.8904 

^^Inp 

0.0470 

0.1270 

0.4678 

0.8570 

Sn 

0.0516 

0.1132 

0.3908 

0.7678 

5.0 

rji2 

0.0510 

0.1436 

0.5272 

0.9050 

^^Inp 

0.0428 

0.1258 

0.4870 

0.8836 

Sn 

0.0472 

0.1046 

0.3704 

0.7664 

Entries:  the  proportion  of  times  out  of  5000  iterations  in  which  each  test 
statistic  exceeded  the  upper  a-percentile  of  its  asymptotic  null  distribution. 


Distribution 

= Power  family. 

a 

= .05 

S 

= I. 

Sample  size 

= 50. 

Dimension 

= 4. 

rp2 

= Hotelling’s  procedure 

^^Inp 

= New  procedure  with  the  linear  (p{-)  function. 

Sn 

= Randles  (2000)  procedure. 

4 
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Table  3.6:  Monte  Carlo  Results  for  the  Multivariate  t-Distributions 


Test  Statistic 

df 

Amount  of  Shift 

1 

0 

lA 

2A 

3A 

rjp2 

0.0136 

0.0262 

0.0772 

0.1748 

^^Inp 

0.0430 

0.1218 

0.4382 

0.7630 

Sn 

0.0432 

0.1508 

0.5442 

0.8698 

3 

rp2 

0.0370 

0.0946 

0.3358 

0.6752 

^^Inp 

0.0452 

0.1186 

0.4514 

0.8164 

Sn 

0.0448 

0.1258 

0.4866 

0.8498 

10 

rp2 

0.0488 

0.1300 

0.4640 

0.8346 

•1 

0.0454 

0.1216 

0.4636 

0.8484 

5n 

0.0444 

0.1172 

0.4520 

0.8372 

20 

rjp2 

0.0526 

0.1518 

0.5310 

0.8978 

^^Inp 

0.0446 

0.1428 

0.5122 

0.8872 

Sn 

0.0402 

0.1342 

0.4944 

0.8688 

Entries:  the  proportion  of  times  out  of  5000  iterations  in  which  each  test 
statistic  exceeded  the  upper  a-percentile  of  its  asymptotic  null  distribution. 
Distribution  = Multivariate  t-distribution. 


a 

S 

Sample  size 
Dimension 

rjp2 

^^Inp 

Sn 


.05 

I. 

50. 

4. 

Hotelling’s  procedure 

New  procedure  with  the  linear  </>(•)  function. 
Randles  (2000)  procedure. 


CHAPTER  4 

SIMPLE  REPEATED-MEASURES  DESIGNS 


In  the  remainder  of  this  dissertation,  the  new  proposed  signed-rank  test 
is  applied  to  the  single-factor  repeated-measures  designs.  A literature  review 
following  the  introduction  of  such  designs  is  presented  in  this  chapter. 

To  introduce  single-factor  repeated-measures  designs,  let  Yi , • • • , Y„  be 
iid  as  Y,  where  Y = (Yi,  • • • , Yp)'  is  from  a p-dimensional,  p > 2,  absolutely 
continuous  population.  Each  observation,  Yj,  is  regarded  as  repeated  measures, 
one  observation  for  each  of  the  p treatments,  taken  on  the  experimental  unit 
(subject).  The  treatments  are  assumed  to  be  fixed  while  the  subjects  are  taken  as 
random.  Moreover,  a subject  by  treatment  interaction  is  included  to  the  model. 
Thus,  the  general  mixed  model  with  a subject  by  treatment  interaction  is  used.  In 
vector  form,  the  model  is 

Yj  = 0ilp  -f  T -f  ()9t )i  -|-  e,-,  i = 1,  • • • ,n  (4.1) 

where  Ip  is  the  p x 1 vector  of  I’s,  r = (ri,  • • • , Xp)'  is  the  vector  of  fixed  treatment 
effects,  represents  the  random  effect  of  the  ith  subject,  denotes  the  vector 
of  the  ith  subject  by  treatment  interactions,  and  is  the  vector  of  random  error  of 
the  ith  subject.  The  following  assumptions  are  made.  The  PiS  are  iid  with  mean 
0 and  variance  <t|,  /9r^s  are  iid  with  mean  0 and  a general  variance-covariance 
matrix,  and  the  Cj’s  are  iid  with  mean  0 and  variance-covariance  matrix  cr^Ip, 
where  Ip  is  the  p x p identity  matrix.  Moreover,  the  random  variables  /5f,  /Jr^, 
for  i = 1, . . . , n are  all  mutually  independent.  The  test  of  equal  treatment  effects  is 
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of  interest.  This  test  can  be  stated  as 


Ho  T]_  = • • - = Tp  versus  Ha  : Tj  for  some  i ^ j. 


(4.2) 


Several  procedures  can  be  considered  for  such  designs.  Starting  with  para- 
metric procedures,  two  classical  approaches  are  often  recommended.  The  first 
procedure,  due  to  Hsu  (1938),  is  an  application  of  Hotelling’s  T^.  Hsu  argues 
that  the  test  in  (4.2)  can  be  carried  out  by  applying  Hotelling’s  (described  in 
Chapter  1)  to  the  {p  — l)-variate  random  vectors,  Xj’s,  obtained  from  the  Yj’s  via 


= Xp_i)'  = (Fi  - Fp,  • • • , Fp_i  -Yp)'  i = 1, . • . , n. 


Recall  that  Hotelling’s  procedure  is  affine-invariant,  and  if  the  Y’s  come  from  the 
p-variate  normal  distribution,  will  have  a null  distribution  that  is  a multiple 
of  the  F-distribution  with  p — 1 and  n — {p  — 1)  numerator  and  denominator 
degrees  of  freedom,  respectively.  The  affine-invariance  property  is  important 
because  it  ensures  that  the  performance  of  the  test  is  not  affected  by  the  shape 
of  the  population  variance-covariance  matrix,  as  is  the  case  with  some  of  the 
tests  considered  later.  The  second  procedure  is  the  classical  analysis  of  variance 


(ANOVA)  F test.  This  procedure  uses  the  Y’s  and  the  test  statistic 


F = 


- y-)' 


-X-  V"  ("F  • - F - F • -)-  F 12  ’ 

n-l  2-^i=l  ■'f  ■‘•J  ^ 


(4.3) 


where  F^  denotes  the  jth  element  of  the  vector  Yj,  Y.j  = (1/n)  Yij,  Yi.  = 
(1/p)  Y^j=i  y-  — (1/^P)  XlILi  y^r  When  the  Yj’s  are  (multivariate) - 

normally  distributed,  the  null  distribution  of  the  ANOVA  test  is  an  exact  F- 
distribution  with  p — 1 and  (n  — l)(p  — 1)  degrees  of  freedom  if  and  only  if  the 
population  variance-covariance  matrix  of  Y,  say  S = (cr«/),  satisfies  the  Huynh  and 
Feldt  condition  (see  Huynh  and  Feldt,  1970).  Huynh  and  Feldt  give  three  forms  to 
express  this  necessary  and  sufficient  condition: 
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1.  The  elements  au'  of  the  variance-covariance  matrix  S are  of  the  form: 


att'  — oit  + cxf  -l-  X5tt' 


(4.4) 


where  A > 0 and  Su'  = 1 if  t and  0 otherwise. 

2.  All  possible  differences  Yij  - Yiy  j ^ f are  equally  variable. 

3.  e = 1,  where  e is  a function  of  the  elements  of  S defined  by 


p^icTd  - 


iP  - 1)  (EL:  E 


where  ad  = (1/p)  Yft=i  = (1/p)  Et'=i  = (1/p)  Et=i 

Note  that  s is  the  multiplicative  adjustment  factor  for  the  degrees  of  freedom 
proposed  by  Greenhouse  and  Geisser  (1959).  Any  matrix  whose  elements  satisfy 
(4.4)  is  referred  to  as  a matrix  of  Type  H.  Two  disadvantages  of  using  the  ANOVA 
F test  is  the  dependence  of  its  performance  on  the  structure  of  the  variance- 
covariance  matrix,  and,  unlike  Hotelling’s  T^,  its  lack  of  the  affine-invariance 
property.  However,  if  the  underlying  distribution  is  p-variate  normal  with  H- 
type  variance  covariance  matrix,  the  F test  will  have  a greater  power  than  the 
Hotelling-Hsu  test  for  the  same  alternatives  (see  Morrison,  1972). 

Morrison  (1972)  developed  several  statistics  to  test  (4.2)  under  various 
assumptions  on  the  variance-covariance  matrix  S.  The  statistics  included  the 
usual  Hotelling-Hsu  statistic  and  the  classical  F test  as  special  cases.  Morrison 
also  measured  the  efficacies  of  the  tests  in  terms  of  the  average  squared  lengths  of 
their  simultaneous  confidence  intervals.  He  demonstrated  that  his  tests  have  higher 
efficacies  than  that  of  Hotelling-Hsu  T^.  These  differences  in  efficacies  decrease 
when  the  sample  size  increases,  and  do  increase  as  the  dimension  increases.  Yet, 
Morrison  only  considered  the  case  where  the  observations  come  from  a multivariate 
normal  distribution. 


Nonparametric  competitors  to  Hotelling’s  appeared  as  early  as  1937  and 
1939  when  Friedman  developed  his  well  known  method  of  ranks  procedure  and 
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Kendall  and  Babington  proposed  an  equivalent  test,  respectively.  Assuming  no  ties 
between  observations  within  a subject,  Friedman’s  test  statistic  has  the  form 


where  Rij  denotes  the  rank  of  Yij  among  Fji,  • • • ,Yip.  If  the  components  of  each 
observation  vector  are  exchangeable,  then  T is  distribution-free.  Under  this  Hg, 
T has  an  asymptotic  chi-square  distribution  with  {p  — 1)  degrees  of  freedom.  An 
alternative  approximation  of  Friedman’s  test  uses  the  statistic 


(n  - 1)T 
n{p  — 1)  — T’ 


which  is  then  compared  with  the  usual  F distribution  with  (p  — 1)  and  (n  — l)(p— 1) 
numerator  and  denominator  degrees  of  freedom,  respectively.  Jensen  (1977)  gave 
error  bounds  on  the  large  sample  approximation  of  Fr.  Iman  and  Davenport 
(1980)  showed  that  the  chi-square  large  sample  approximation  can  be  inaccurate 
and  recommended  using  the  F approximation.  In  their  paper,  Iman  and  Davenport 
also  proposed  two  new  approximations  of  Friedman’s  test.  The  first  approximation 
is  a linear  combination  of  the  T and  Fr,  while  the  second  approximations  is 
obtained  by  using  the  ranks,  Rij,  to  estimate  the  degrees  of  freedom  for  Fr.  It  is 
important  to  note  that  Friedman’s  procedure  only  uses  within-subject  information 
and  ignores  between-subjects  information. 

In  order  to  reduce  the  effect  of  subject-to-subject  variation,  Koch  (1969) 
proposed  using  a test  statistic  based  on  the  ranks  of  the  aligned  observations. 

These  observations  are  obtained  by  subtracting  the  subject  average  from  each 
component  of  that  subject’s  response  vector.  After  this  centering,  the  entire  set  of 
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the  np  observations  is  ranked.  Koch  aligned-ranks  test  statistic  has  the  form 


KAR  = 


n 


n(p-l)  Z^i=l  ^i  ) 


2’ 


where  Rfj  = rank  of  {Yij  — Fj.)  among  Yn  — Fi.,  • • • , Ynp  — Yn.,  R?j  = 

^ ZliLi  - p Z)i=i  If  Yi,  • • • , Y„  are  iid,  then  under  the  null 

hypothesis  of  the  exchangeability  of  the  components  of  Yj,  the  statistic  KAR  has 
an  asymptotic  chi-square  distribution  with  (p-1)  degrees  of  freedom. 

In  an  attempt  to  retrieve  some  of  the  between-subjects  information,  Quade 
(1979)  presented  a class  of  procedures  of  weighted  within-subject  rankings.  The 
class  contains,  as  a special  case,  a p-sample  extension  of  Wilcoxon  (1945)  signed- 
rank  test.  The  idea  was  to  give  larger  weights  to  subjects  that  show  the  largest 
variability  among  the  treatment  responses  for  that  subject.  Thus,  if  Rij  is  the  rank 
of  Yij  among  Yu,  • • • , Di  = D(Yj),  a location-free  statistic  that  measures  the 
variability  within  the  components  of  F,  and  Qi  the  rank  of  Di  among  Di,-  - - , 
then  Quade’s  procedure  has  the  form; 

Wo  = 9r.(n+l)(p+l) 

^ np(n -f  l)(p -l- l)(2n -I- 1)  2(2n-|-l)  ’ 

where 

p / n 
j=l  \i=l 

Under  the  null  hypothesis  of  the  exchangeability  of  the  components  of  the  Yi,  Wq 
is  strictly  distribution-free,  and  has  an  asymptotic  chi-square  distribution  with 
(p  — 1)  degrees  of  freedom.  In  a Monte  Carlo  study,  Silva  (1977)  not  only  found  im- 
proved efficiencies  of  the  weighted-rankings  over  unweighted  but  also  demonstrated 
that  the  choice  of  the  measure  D had  a trivial  effect  on  the  performance  of  Wq  . 

In  an  attempt  to  find  natural  nonparametric  analogs  for  Hotelling’s  and 
the  ANOVA  F test,  Iman,  Hora  and  Conover  (1984)  and  Agresti  and  Pendergast 
(1986)  used  the  rank  transformation  approach.  This  approach  consists  of  ranking 
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all  observations  from  1 to  np  and  replacing  the  data  with  their  corresponding  ranks 
in  calculating  the  parametric  tests. 

Iman,  Hora  and  Conover  (1984)  proposed  computing  the  ANOVA  F test  using 
these  ranks.  That  is  the  test  statistic  is  defined  as 


* \2 


— 


nT,U{R',-R:) 


^ Efei  - K - R'i  + *.)"' 


(4.6) 


where  denotes  the  rank  of  Yij  among  Fxi»  • • • > ^np>  F*j  = (V^)  XliLi  ^iji 
Rl  = (1/p)  R*ip  and  Rl  = (1/np)  R*ij-  In^^n,  Hora  and  Conover 

considered  a null  hypothesis  of  the  form 


Ho  : Fij  = Fj  for  i = 1,  • • • , n and  j = 1,  • • • ,p, 

where  Fij  is  the  continuous  distribution  function  of  the  random  variable  Yij.  Under 
the  assumption  that  the  Vfj’s  are  mutually  independent  and  for  some  regularity 
conditions  on  Fj,  Iman,  Hora,  and  Conover  showed  that  the  null  asymptotic 
distribution  of  (p  - l)Fin’  is  that  of  a chi-square  distribution  with  (p  - 1)  degrees 
of  freedom.  Note  also  that  conditioned  on  the  ranks  given  to  each  subject,  F^kt 
is  distribution-free.  Iman,  Hora,  and  Conover  (1984),  and  later  Hora,  and  Iman 
(1988)  compared  the  rank  transformation  test  to  the  ANOVA  F test,  Friedman’s 
test,  Quade’s  weighted-ranking  test,  and  the  aligned-rank  test,  through  Monte 
Carlo  Studies  and  ARE  calculations.  They  concluded  that,  for  data  from  the 
normal  distribution,  all  the  test  procedures  discussed  had  higher  powers  compared 
to  Friedman’s  test,  but  these  differences  in  power  decreased  as  the  dimension 
increased.  Data  generated  from  the  lognormal  distributions  showed  the  superiority 
of  nonparamteric  tests,  with  the  rank  transformed  test  having  the  highest  power, 
over  the  ANOVA  F test.  This  finding  is  due  to  the  fact  that  lognomal  distributions 
produce  outliers.  For  light-tailed  cases,  the  ANOVA  F test  and  Quade’s  test  had 


the  highest  powers,  while,  for  heavy-tailed  cases,  Friedman’s  test  and  the  rank 
transformation  test  were  the  best  to  use. 
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Agresti  and  Pendergast  (1986)  suggested  using  this  single  overall  ranking 
of  the  observations  and  then  computing  the  Hotelling-Hsu  statistic  using  these 
ranks.  They  considered  the  case  where  the  null  hypothesis  of  no  treatment  effects 
is  expressed  as  Fi  = F2  = • • • = Fp,  where  Fi,  • • • , Fp  are  the  one  dimensional 
marginals  distributions  of  Y = (Yi,  • • • , V^).  If  R-j  is  defined  as  the  rank  of  Yij 
among  Yu,  • • • , Ynp,  then  the  rank  transformation  version  of  the  Hotelling-Hsu 
statistic  is  of  the  form 

= n{BR*y{BSB')-\BR*), 

where 

1-1  0 •••0  0 
0 1 -1  •••  0 0 


0 0 0 •••  1 -1 

R*  = R-l » R-i  = (Rii,  • • • 1 Rip)'i  and  S = (sot),  where  Sab  = “ 

R*g){Rlb  — R*i,)/{n  — p + 1).  If  the  vector  R*  is  normally  distributed,  the  null 
asymptotic  distribution  is  chi-square  with  (p  — 1)  degrees  of  freedom.  Agresti 
and  Pendergast  showed,  through  Monte  Carlo  results,  that  it  was  reasonable  to 
approximate  the  null  distribution  of  T^/{p  — 1)  as  an  F distribution  with  {p  — 1) 
numerator  degrees  of  freedom  and  n — (p  — 1)  denominator  degrees  of  freedom. 

Agresti  and  Pendergast  (1986)  proposed  another  statistic  that  made  use  of 
this  joint  ranking.  The  null  hypothesis  was  considered  as  the  exchangeability  of  the 
components  of  the  Yj.  Their  statistic  included  Koch’s  statistic  and  the  ANOVA  F 
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rank  transformation  statistic  as  special  cases  and  is  based  on 


AP  = 


0-2(1  - p) 


where  i?*-  = n P = Corr(i?^,  j ^ f,  and  = Var(i?^). 

They  argued  that  if  the  vector  of  the  the  first  (p-1)  treatment  averages  is  normally 
distributed,  the  null  asymptotic  distribution  of  AP  is  chi-square  with  {p  — 1) 
degrees  of  freedom.  Kepner  and  Robinson  (1988)  not  only  gave  suflScient  conditions 
under  which  the  vector  {R\,  • • • , R*i)  is  normally  distributed  but  also  suggested 
a suitable  estimate  of  o-^(l  — p)  which  can  be  used  in  order  to  find  conditional 
but  exact  p-values  of  the  test  for  small  sample  sizes.  Moreover,  Ernst  and  Kepner 
(1993)  investigated,  through  a Monte  Carlo  study,  the  performance  of  Koch’s  test, 
the  rank  transformation  statistic  (4.6),  Friedman’s  test  and  the  ANOVA  F test. 
They  observed  that  the  Koch’s  statistic  was  better  compared  to  its  chi-square 
limiting  distribution  as  opposed  to  its  F approximating  limiting  distribution  and 
the  converse  holds  for  the  rank  transformation  ANOVA  F test. 

Jan  and  Randles  (1996)  proposed  applying  the  one-sample  interdirection  sign 
test  of  Randles  (1989)  and  the  one-sample  interdirection  signed-rank  test  of  Peters 
and  Randles  (1990)  to  this  repeated  measures  design  problem.  They  formed  the 
(p-1)  vectors  — (A"ji,  * * * , A”^p_i)  — (F^i  * * * , p—i  F^p)  and  then 

computed  the  interdirection  sign  and  signed-rank  statistics  on  the  transformed 
observations.  The  test  based  on  the  sign  test  has  the  form 


RIS  = 


p-1 


n p 


n 


cos(Trpifc), 


i=i  i=i 


Pik 


Cik  dfi 

= 0 a i = k, 


if  i ^ k, 


where 
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~ I [(0-2)  ~ C-Dl’  denotes  the  interdirection  of  the  vectors  Xj,  X^. 

The  form  of  the  second  test,  which  will  be  denoted  hy  JRn,  can  be  easily  found 
from  (1.3).  If  the  Yf’s  are  iid  from  a p-variate  elliptically  symmetric  distribution 
then  the  null  limiting  distribution  of  RIS  and  also  of  JRn  is  chi-square  with  {p  — 1) 
degrees  of  freedom. 

In  the  next  chapter,  the  sign  test  of  Randles  (2000)  and  the  new  signed-rank 
test  defined  in  Chapter  2 are  applied  to  the  simple  repeated-measure  setting. 

The  null  asymptotic  distributions  of  the  sign  test  and  the  signed-rank  test  are 
investigated  under  the  assumption  that  the  observations  come  from  a directionally 
symmetric  distribution  and  a symmetric  distribution,  respectively.  In  Chapter 
6,  the  asymptotic  distributions  of  these  tests  are  obtained  for  a sequence  of 
alternatives  approaching  the  null.  These  limiting  distributions  are  then  used  to 
find  the  Pitman  asymptotic  relative  efficiencies  of  the  tests.  Comparisons  between 
these  tests  and  Jan  and  Randles  (1996)  signed-rank  test  is  done  based  on  the  ARE 
findings.  Also,  in  Chapter  6,  comprehensive  Monte  Carlo  studies  are  presented. 
These  Monte  Carlo  studies  include  most  of  the  tests  described  in  the  literature 
review  and  are  aimed  on  comparing  all  these  procedures  and  the  new  procedures 
discussed  in  Chapter  5.  Finally,  general  comments  and  recommendations  based  on 
this  dissertation  are  given. 


CHAPTER  5 

MULTIVARIATE  SIGN  AND  SIGNED-RANK  TESTS  APPLIED  TO  THE 

SIMPLE  REPEATED-MEASURES  DESIGNS 

5.1  Introduction 

Simple  repeated-measures  designs  have  experimental  settings  in  which  n 
subjects  are  measured  on  p occasions  with  no  replication.  Moreover,  the  subjects 
are  treated  as  random,  while  the  repeated  factor  is  considered  fixed.  Letting 
Yi,  • • • , Y„,  be  iid  as  Y,  where  Y = (Yi,  • • • , Yp)'  is  from  a p-dimensional,  p > 2, 
absolutely  continuous  population,  this  mixed  model  can  be  formulated  as 

Yi  = 0ilp  + T + {0T)i  -I-  €i,  i = 1,  • • • , n (5.1) 

where  Ip  is  the  p x 1 vector  of  I’s,  r = (ti,  • • • , Xp)'  is  the  vector  of  fixed  treatment 
effects,  Pi  represents  the  random  effect  of  the  ith  subject,  0Ti  denotes  the  vector 
of  the  zth  subject  by  treatment  interactions,  and  Ci  is  the  vector  of  random  error 
of  the  zth  subject.  It  is  assumed  that  the  Pi's  are  iid  with  mean  0 and  variance 
(t1,  PTi'S  are  iid  with  mean  0 and  a general  variance-covariance  matrix,  and  the 
€j’s  are  iid  with  mean  0 and  variance-covariance  matrix  cr^Ip,  where  Ip  is  the  p x p 
identity  matrix.  Moreover,  the  random  variables  Pi,  Pvi,  ei  for  i = 1, . . . , n are  all 
mutually  independent.  The  test  of  equal  treatment  effects. 

Ho  Ti  = • • • = Tp versus  Ha '■  Ti  ^ Tj  for  some  i ^ j,  (5.2) 

is  of  interest. 


53 


To  apply  the  sign  test  of  Randles  (2000)  and  the  new  signed-rank  test  to 
this  repeated-measures  setting,  the  connection  between  this  problem  and  the 
one-sample  location  problem  is  first  established. 

Consider  the  following  transformation: 


Xj  = (Xji,  • • • ,Xi  p-i)'  — {Yii  — Yip,  • • • ,Yi  p_i  — Yip)'  — CYj,  i — 1,  • • • , n,  (5.3) 


where  C is  the  {p  — 1)  x p matrix, 


Under  this  transformation,  the  location  parameter  of  the  X’s  is 


e = Cr  = [ri-Tp,--  - , Tp_i  - Tpj. 

Hence,  using  the  p-variate  Y’s  to  test  (5.2)  is  transformed  to  using  the  {p  — 1)- 
variate  X’s  to  test 


Ho'.  9 = 0 versus  Ha  - 9 ^0, 
which  is  nothing  but  a one-sample  location  problem. 

5.2  The  Test  Statistics 

In  order  to  compute  the  quadratic  form,  Sn,  due  to  Randles  (2000),  and 
the  new  signed-rank  statistic,  W„  p_i,  on  the  transformed  vectors,  let  Ax  be  the 
(p  — 1)  X (p  — 1)  nonsingular  matrix  that  is  computed  using  the  X’s  and  that 
satisfies 

1^/  AxXi  ^ ( AxXi  V_  1 ^ 

n^\^l|AxX,||J  l,l|AxX,||; 
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A A ^ 

Moreover,  let  Qi  be  the  rank  of  ||AxXt||  among  ||AxXi||,  • • • , ||AxX„  | and  define 


V,-  = 


AxXj 

AxXj 


ysi  = <!> 


Qi 


n + 1 


V 

^ 2? 


where  (f>{-)  is  a nonnegative,  nondecreasing,  uniformly  bounded  continuous  function 
that  may  depend  on  the  dimension  p — 1.  Note  that,  for  efficiency  calculations  and 
Monte  Carlo  studies,  the  </>{•)  function  defined  in  (2.6)  will  be  used.  The  quadratic 


forms  are  then 


Sn  = n{p  - 1)V'V, 


(5.5) 


and 


n 


w„  = nV;(n-‘  Y, 


(5.6) 


i=l 


where 


1 " _ 1 " 

v=-y'V(,  v.  = -y'v,i, 

"tr  "In 

and  Ho  is  rejected  for  large  values.  The  test  statistic  is  distribution-free, 
under  Ho  for  the  class  of  distribution  with  elliptical  directions  and  conditionally 
distribution-free  for  the  broader  class  of  directionally  symmetric  distributions.  The 
quadratic  form  Wn  p-\  is  conditionally  distribution-free  for  the  class  of  symmetric 
distributions.  Moreover,  as  mentioned  earlier,  both  test  statistics  are  affine- 
invariant  (with  respect  to  the  X's)  and  thus  their  performances  do  not  depend  on 
the  structure  of  the  underlying  population  variance-covariance  matrix  as  in  the 
case  of  some  of  the  tests  discussed  in  chapter  4.  Note  that  the  particular  contrast 
scheme  in  (5.3)  is  used  without  loss  of  generality.  In  fact,  if  D is  any  (p— 1)  x (p— 1) 
nonsingular  matrix,  then  using  the  transformation  DX  will  produce  exactly  the 
same  value  for  the  test  statistic,  since  and  Wn  p~i  are  affine-invariant.  For 
example,  many  authors  use  the  scheme: 


Xt  = (XA,  • • • , Xr_,y  = {Ya  - Yi2,  Yi2  - Fis,  • • • , p-i  - Yip)'  = C*Y.  (5.7) 
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The  matrix  C*  in  (5.7)  can  be  written  as  DC,  where  D is  the  (p  — 1)  x {p  — 1) 
nonsingular  matrix 

1-1  0 0 •••  0 0 

0 1 -1  0 •••  0 0 

• • • •••••  • ^ 

0 0 0 0 •••  1 -1 

0 0 0 0 •••  0 1 _ 

thus,  using  the  contrast  scheme  in  (5.3)  or  the  one  in  (5.7)  will  result  in  the  same 
value  of  the  test  statistic. 

5.3  Asymptotic  Null  Distribution 

The  following  two  lemmas  are  important  in  the  development  of  the  null 
limiting  distributions  of  the  test  statistics. 

Lemma  5.3.1.  If  Yi,  •••  , Y„  are  iid  from  a symmetric  distribution  around  t, 
then  for  any  fixed  matrix  C,  the  vectors  Xf  = CYj,  i = 1,  • • • , n,  are  iid  symmetric 
around  0 = Cr. 

Proof  It  is  clear  that  the  Xj’s  are  iid  since  the  Yi’s  are.  Moreover,  since  the  Yf 
are  symmetric  around  r,  then 

(Yi-r)  4 -(Yi-r). 

Using  the  transformation  H{u)  = Cu  results  in 

Xi-e  = CYi-CT  = H{Yi-T)  = H{T-Yi) 

= Cr-CYi 

= 0-Xi. 


Thus,  the  Xj’s  are  symmetric  around  0. 


□ 
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Note  that,  for  the  simple  repeated-measures  setting  considered,  r = rl',  C is 
given  in  (5.4),  and  0 = 0. 

Lemma  5.3.2.  //Yi,  • • • , Y„  are  iid  from  a directionally  symmetric  distribution, 
then  for  any  fixed  matrix  C,  the  vectors  Xi  = CY*,  i = !,•••  ,n,  are  also  iid  from  a 
directionally  symmetric  distribution. 

Proof  It  is  clear  that  the  Xj’s  are  iid  since  the  Yj’s  are.  Moreover, 


since  the  Yj’s  come  from  a directionally  symmetric  distribution.  Using  the  trans- 
formation H{u)  = Cu,  results  in 


CYi 

IIYill 

Applying  the  transformation  •0(v)  = v/ 


d 


CYi 

iiYiir 

, ||v||  ^ 0 gives 


Thus,  the  Xj-’s  have  a distribution  that  is  directional  symmetric.  □ 

The  null  asymptotic  distributions  of  Sn  and  p_i  are  discussed  in  the 
following  two  theorems. 

Theorem  5.3.1.  If  Yi,-  • • , Y„  are  iid  from  a directionally  symmetric  distribution, 
then,  under  Ho,  the  quadratic  form,  Sn,  defined  in  (5.5)  has  limiting  chi-square 
distribution  with  p — 1 degrees  of  freedom. 


Proof.  The  proof  follows  readily  from  lemma  5.3.1  and  theorem  1 of  Randles 
(2000). 
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□ 

Theorem  5.3.2.  IfYi,  • • • , Y„  are  iid  from  a symmetric  distribution,  then,  under 
Ho,  the  quadratic  form,  Wn  p-i,  defined  in  (5.6)  has  limiting  chi-square  distribution 
with  p — 1 degrees  of  freedom. 

Proof.  The  proof  follows  readily  from  lemma  5.3.2  and  theorem  2.4.1  in  chapter 

2.  □ 


CHAPTER  6 
TEST  COMPARISONS 


6.1  Asymptotic  Distribution  Under  Contiguous  Alternatives 
In  the  first  set  of  test  comparisons,  the  test  statistics  5„  and  Wn  p~i,  defined 
in  (5.5)  and  (5.6)  respectively,  along  with  PRn,  which  is  the  test  of  Peters  and 
Randles  applied  to  the  transformed  observations  (see  Jan  and  Randles,  1996),  are 
compared  using  their  corresponding  Pitman  asymptotic  relative  efficiencies  relative 
to  Hotelling-Hsu  T^. 

The  computation  of  the  ARE’s  of  these  tests  requires  finding  their  corre- 
sponding asymptotic  distributions  under  contiguous  alternatives.  These  derivations 
are  done  assuming  that  the  Y’s  come  from  the  class  of  elliptically  symmetric 
distributions.  Thus,  assume  that  the  Y’s  come  from  a distribution  of  the  form 


UMy)  = Kp\i:\-h{{y  - r)'S-i(y  - r)}. 


(6.1) 


where  Kp  > 0,  and  S is  proportional  to  the  covariance  matrix  of  Y . The  following 
lemma,  which  is  given  as  an  exercise  in  Fang,  Kotz  and  Ng’s  (1990)  book,  is  stated. 
Theorem  6.1.1.  IfY  has  ap-variate  elliptically  symmetric  distribution  with 
parameters  t and  S,  then  X = CY,  defined  in  (5.3),  has  a (jp  — l)-variate 
elliptically  symmetric  distribution  with  parameters  Cr  and  CSC'. 

Proof,  see  Theorem  2.16  of  Fang,  Kotz,  and  Ng,  1990.  n 

Before  investigating  the  asymptotic  distributions  of  and  p_i  under 
contiguous  alternatives,  the  density  function  of  the  transformed  observations,  X’s, 
as  well  as  that  oi  R = X'X  are  obtained. 
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Theorem  6.1.2.  IfY  has  a density  function  given  by  (6.1),  then  X,  defined  in 
(5.3),  has  a density  function  of  the  form 
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/ct,csc'(X)  — Xp|CSC^|  — Ct)^(CSC^)  ^(X  Ct)}, 


where 


h' 


/+CX5 

-oo 


h{t  + s^)ds. 


(6.2) 


Proof.  This  proof  is  the  same  as  the  proof  given  by  Jan  (1991)  but  for  the  general 
case  of  all  elliptically  symmetric  distribution.  Consider  the  transformation, 


z = 


2/p 


yi  -Vp 


2/p-i  Vp 

Up 


1 


-1 


1 


p 

By, 


where  C is  given  in  (5.4),  and  Cp  = (0,  • • • ,0, 1)'.  Note  that  B is  nonsingular  and 


IBl  = 1,  thus  y = B ^z,  and 


/z(z)  = /y(y)|y=B-iz 

= i^p|S|-^{(z  - Br)'(BEB')“^(z  - Br)} 

= Kp\B'EB'\~h{{z  - Bt)'(BSB')”Hz  “ Br)} 
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The  last  equality  is  due  to  the  fact  that  |BSB'|  = 1B||S||B' 
Now,  the  density  function  of  X is 

'-1-00 


SI  since  B 


/-t-oo 

/z(z)dyp 

-OO 


where 


i^plBSB 


+00 


h{{z  - Br)'(BSB')"Hz  ~ Br)}d?/p 


00 


= K, 


1 r^°° 

|A|-^  / 

J — OO 


h{(z-/i)'(A)  ^{z-fi)}dyp, 


A = BSB'  and  fx  = Br. 


The  matrix  A and  the  vector  //  can  be  also  written  as 


A = BEB'  = 


CSC' 

CScp 

All 

Ai2 

e;SC' 

A21 

A22 

and 


fx  = Br  = 


Cr 

0 

6^  T 

1 

1 

The  inverse  of  A has  the  following  form: 

-1 

Ai  1 Aio 

A-'  = 


All  Ai2 
A21  A22 


All  All  A12A22.1A21A1I  — A^j  A12A22.1 

~A22-lAi2Aii  A22.1 


A22-1  — A22  ~ A21A11  A\2- 


= 1. 


(6.3) 


where 
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Also,  the  determinant  of  A is  such  that 


— I All  I • IA22 


A2iAi]^Ai2|  = |Aii|  • IA22.1I- 


Thus,  the  quadratic  form  in  (6.3)  can  be  written  as 

(z  - /i)'A“^(z  - /x) 

= (x  - 0)'A{]^(x  - 0)  + (x  - 0)'A;"/Ai2AJ2\A2iA^j^(x  - 0) 

HVp  - - 2(x  - 0)'Ai-/Ai2A22a(?/p  - Tp) 

= (x  - 0)'An  (x  -0)  + [(yp  - Tp) A22^f  - (x  - 0)'An^Ai2A22^f]^, 


and 


/x(x)  = Ap|Aii|  • IA22.1I 

^+00  , , . . (6.4) 

I h I (x  - 0)' Afi^  (x  - 0)  + [{yp  - Tp) A22^.f  - (x  - 0)'An^  A^i2^22-i  ? } dyp 


— 00 


Taking  s = {yp-  Tp)A22^^  ~ (x  - 0)'Au^Ai2A22^i^,  expression  (6.4)  becomes 

/+00 

h{{yi-0)'An^{x-0)  + s^}ds. 

■00 

This  completes  the  proof,  since  An  = CSC'  and  0 = Cr. 


□ 


Corollary  6.1.1.  IfY  has  a density  function  given  by  (6.1),  and  CSC'  = Ip_i, 
then,  under  Hg,  R = X'X  has  a density  function  of  the  form 


0 < r < 00, 


where  h*{-)  is  as  defined  in  (6.2). 

Proof.  The  proof  is  a direct  application  of  theorem  2.9,  p.  35  of  Fang,  Kotz  and 
Ng’s  1990  book.  (Also,  see  Hossjer  and  Croux,  1995.)  □ 
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Now,  consider  the  sequence  of  alternatives  where  Xi,  • • • ,X„  are  iid  with 
density  /x(x  — where  /x(-)  is  given  in  (6.4),  and  c is  a non-zero  (p  — 1)- 
dimensional  vector. 

The  following  two  theorems  characterize  the  limiting  distribution  of  Wn  p-i 
and  Sn  under  the  sequence  of  alternatives  approaching  the  null. 

Theorem  6.1.3.  //Yi,  • • • , Y„  are  iid  from  an  elliptically  symmetric  distribution 
and  CSC'  = Ip_i,  then,  under  the  sequence  of  alternatives  approaching  the  null 
hypothesis,  the  limiting  distribution  of  Wn  p-i  is  non-central  chi-square  with  p — 1 
degrees  of  freedom  and  non- centrality  parameter 


Proof  When  CSC'  = Ip_i  and  under  the  null  hypothesis,  the  transformed 
observations  are  iid  from  an  elliptically  symmetric  distribution  with  a density 
function  of  the  form 


Therefore,  the  proof  follows  from  applying  theorem  3.1.1  to  the  transformed 
sample.  ^ 

Theorem  6.1.4.  //Yi,  • • • , Y„  are  iid  from  an  elliptically  symmetric  distribution 
and  CSC'  = Ip_i,  then  under  the  sequence  of  alternatives  approaching  the  null 
hypothesis,  the  limiting  distribution  of  Sn  is  in  non-central  chi-square  with  p — 1 
degrees  of  freedom  and  non- centrality  parameter 


(6.5) 


where  Rf  = X'X,  h*'{R^)  = and  G{t)  = PhM^  <t),  t>  0. 


/x(x)  = Kph*{x'x). 


(6.6) 


= and  G{t)  = PhM^  <t),t>  0. 


where  R?  = X'X  and  h*'  {R?) 
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Proof.  The  proof  follows  from  the  fact  that  can  be  obtained  from  Wn  p_i  by 
setting  (j){u)  = 1. 


□ 


Note  that  the  assumption  CSC'  = Ip_i  is  used  without  loss  of  generality  since 
the  test  statistics  involved  are  affine-invariant. 


6.2  Asymptotic  Relative  Efficiencies 
The  Pitman  Asymptotic  Relative  Efficiency  between  two  test  statistics  is 
defined  in  Chapter  3.  In  this  section,  the  ARE  of  Wn  p_i  and  that  of  5„,  relative 
to  Hotelling-Hsu  are  obtained.  Two  special  families  from  the  class  of  elliptically 
symmetric  distributions  are  considered,  the  power  family  and  the  multivariate  t- 
distributions  family.  These  two  families  are  described  in  Chapter  3.  As  in  Chapter 
3,  the  efficiencies  obtained  are  ratios  of  non-centrality  parameters  of  chi-square 
limiting  distributions. 

Theorem  6.2.1.  If  Yi,  • • • , Y„  are  iid  from  the  power  family  of  distributions,  then 
the  Pitman  asymptotic  relative  efficiency  of  Wn  p-i  with  respect  to  Hotelling-Hsu 


ts 


77i2 

4 


ARE(W.,  ,-ur)  = (—) 


E„J4'HG(R))] 


where 


E 


H, 


-"-Iwi  f ‘“'i/ 


+ 00 


oo 


( 2 , 

[r  -h  s ) e '■  <=o  ' 


ds 


dr, 


and 


G{r)  = Kp 


27tV  rr  r+oo  ,.2. .2 


kLL 


^ ^'^ds  dt. 


(6.7) 


(6.8) 


Proof.  Without  loss  of  generality,  let  CSC'  = Ip-i.  Result  (6.7)  follows  immedi* 


(6.9) 


ately  from  Theorem  6.1.3  and  Theorem  3.2.1.  To  prove  (6.8),  recall  that  for  the 
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power  family, 


Mt)  = and.  K,  = 

where  Cq  is  given  in  (3.3),  thus  h*{t),  defined  in  (6.2),  is  such  that 


K 


/+00  t + 

exp{-(— : — )"}<^«> 


—oo 


and 


dt  <7-00 


ds. 


(see  Rudin,  1976,  Theorem  9.42,  p.236).  Moreover,  by  corollary  6.1.1, 

9n(r)  = (’■'). 

thus 


E 


He 


■ h*'{R^)  . 


r 

Jo 


2ir^ 

f(^) 


ffR(r)dr 
Jo^°°  <f>(G(r)y-^h*'(r^)dr. 


Replacing  h*  (r^)  by  its  corresponding  expression,  (6.8)  is  established.  Expression 
(6.9)  is  easily  established  by  replacing  p(-)  by  its  corresponding  expression  in 


G(r)  = [ g{t)dt 
Jo 


□ 


Theorem  6.2.2.  IfYi,---  ,Y„  are  iid  from  the  power  family  of  distributions,  then 
the  Pitman  asymptotic  relative  efficiency  of  Sn  with  respect  to  Hotelling-Hsu  is 


ARE{Sr,,T^)  = (^)^H, 


R 


h*'{R^) 

h*(R^) 


(6.10) 
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where 


E 


He 


R 


h*'{R^) 

h*{R?)_ 


ds 


dr. 


Proof.  Without  loss  of  generality,  let  CSC'  = Ip_i.  Thus,  the  proof  follows  by 
setting  (f){u)  = 1 when  applying  theorem  6.2.1. 


□ 


Theorem  6.2.3.  If  Yi,  - • • , Y„  are  iid  from  the  family  of  multivariate  t- 
distributions,  then  the  Pitman  asymptotic  relative  efficiency  ofWn  p-i  with  respect 


to  Hotelling-Hsu  is 


ARE{Wn  = { 


41/ 


) 


Eh. 


(p-l)(i/-l)'  E„U^(G(R)) 


(6.11) 


where 


E 


Ho 


-K. 


{p  + j>')7r  2 


" ^r( 


/)7T  2 r°°  , f+°° 

o / 0 J —00 


1 + 


r^  + 


u 


P+I/-H2 

2 


ds 


dr, 


and 


E— 1 or  p+oo 


G(r)  = K^j;i 


( 1 + 


r(2: 


P + s- 


_E±*1 


ds  dt. 


— OO 


(6.12) 


(6.13) 


Proof.  Without  loss  of  generality,  let  CSC'  = Ip_i.  Expression  (6.11)  follows  from 
theorem  6.1.3  and  theorem  3.2.2.  To  establish  (6.12),  recall  that  for  multivariate 


t-distributions. 


h(()  = (1  + lr‘^,  and,  K,  = 


Therefore,  h*{t),  defined  in  (6.2),  is  such  that 


h' 


/+0 
■OO 


-f  00  / -/•  I «2  \ 2 


1 + 


t + S' 


V 


ds, 


and 


dh*  (t) 
dt 


2l^  7-00 


_p+u-\-2 

t + s^\  2 

1 H I ds. 
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The  density  function  of  R becomes 


p-i 


+°°  / t + s 
(l  + 

— OO  \ 


o\ 

2 \ 2 


ds. 


V 


(6.14) 


Hence, 


E 


H, 


/»+00 


i ( 2 ^ •'0 


which,  by  replacing  h*'{r^)  by  its  value,  gives  (6.12).  Finally,  (6.13)  is  obtained  by 


integrating  expression  (6.14)  over  [0,  r]. 


Theorem  6.2.4.  If  Yi,  • • • , Y„  are  iid  from  the  family  of  multivariate  t- 
distributions,  then  the  Pitman  asymptotic  relative  efficiency  of  Sn  with  respect 
to  Hotelling-Hsu  is 


ARE{Sn,T^)  = ( 


4u 


(p-  l)(i/-  1) 


)El 


R 


h*'{R^) 

h*(R'^)_ 


(6.15) 


where 


E 


H, 


R 


h*'  (i?2) 
h*{R?)_ 


= -K. 


(p  + I/)7T  2 


" uVm^)  Jo 


poo  /»4-oo 

/ / 

Jo  J — OO 


1 + 


2 

^2  _|_  ^2  \ 2 


ds 


dr. 


u 


Proof.  This  theorem  is  a special  case  of  theorem  6.2.3,  where  (/»(•)  = 1. 


□ 


The  asymptotic  relative  efficiencies  of  the  tests  obtained  by  applying  the 
new  signed-rank  test  with  the  linear  </>(•)  function,  Winp,  the  sign  test  of  Randles 
(2000),  Sn,  and  the  signed-rank  test  of  Peters  and  Randles  (1990),  PRn,  to  the 
repeated-measure  designs  are  presented  in  tables  6.1  and  6.2.  Table  6.1  displays 
the  efficiencies  for  the  cases  when  the  observations  come  from  the  power  family  of 
distributions,  while  Table  6.2  holds  the  results  for  the  cases  when  the  observations 
come  from  the  family  of  multivariate  t-distributions. 
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Table  6.1:  Asymptotic  Relative  Efficiencies  Under  the  Power  Family  of  Distribu- 
tions for  the  Repeated-Measures  Problem 


V 

Dimension 

Test 

5 

2 

1 

.5 

.1 

2 

p—l 

.876 

.878 

.955 

1.266 

PRn 

.876 

.878 

.955 

1.266 

Sn 

.420 

.478 

.637 

1.216 

3 

p—l 

.943 

.927 

.971 

1.206 

PRn 

1.083 

1.012 

.985 

1.078 

8.410 

Sn 

.574 

.636 

.785 

1.234 

50.710 

4 

p—l 

.973 

.946 

.978 

1.167 

11.269 

PRn 

1.181 

1.054 

.975 

.987 

4.202 

Sn 

.660 

.719 

.849 

1.201 

20.173 

5 

p—l 

.989 

.957 

.982 

1.140 

7.418 

PRn 

1.231 

1.066 

.961 

.935 

2.786 

Sn 

.716 

.771 

.884 

1.171 

11.424 

8 

p—l 

1.012 

.976 

.989 

1.092 

3.694 

PRn 

1.273 

1.059 

.929 

.861 

1.517 

Sn 

.810 

.852 

.931 

1.115 

4.741 

20 

p—l 

1.025 

.996 

.997 

1.037 

1.707 

PRn 

1.201 

.990 

.871 

.792 

.856 

Sn 

.919 

.939 

.974 

1.049 

1.893 

Entries:  Pitman  asymptotic  relative  efficiencies  with  respect  to  Hotelling-Hsu’s  T^. 
Distribution  = Power  family. 

p-i  = New  procedure,  with  the  linear  (/»(•),  applied  to  the  X’s. 

PR^  = Peters  and  Randles  procedure  applied  to  the  X’s. 

Sji  = Randles  (2000)  procedure  applied  to  the  X’s. 
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Table  6.2:  Asymptotic  Relative  Efficiencies  Under  the  Multivariate  t-Distributions 
for  the  Repeated-Measures  Problem 


V 

Dimension 

Test 

3 

6 

10 

20 

2 

p—1 

1.900 

1.164 

1.054 

.997 

PRn 

1.900 

1.164 

1.054 

.997 

Sn 

1.621 

.879 

.757 

.690 

3 

p—1 

1.954 

1.813 

1.068 

1.011 

PRn 

1.041 

1.123 

1.005 

.985 

Sn 

2.000 

1.084 

.934 

.851 

4 

p—1 

2.010 

1.196 

1.077 

1.018 

PRn 

1.621 

1.064 

1.000 

.978 

Sn 

2.162 

1.172 

1.009 

.920 

5 

p—1 

2.055 

1.209 

1.084 

1.023 

PRn 

1.533 

1.018 

.964 

.952 

Sn 

2.250 

1.220 

1.051 

.958 

8 

p—1 

2.143 

1.235 

1.099 

1.032 

PRn 

1.386 

.930 

.892 

.894 

Sn 

2.371 

1.286 

1.107 

1.009 

20 

p—1 

2.270 

1.275 

1.121 

1.043 

PRn 

1.225 

.816 

.785 

.797 

Sn 

2.480 

1.345 

1.158 

1.056 

Entries:  Pitman  asymptotic  relative  efficiencies  with  respect  to  Hotelling-Hsu’s  T^. 
Distribution  = t-distribution. 

Win  p-1  = New  procedure,  with  the  linear  <^(-),  applied  to  the  X’s. 

PRn  = Peters  and  Randles  procedure  applied  to  the  X’s. 

Sn  = Randles  (2000)  procedure  applied  to  the  X’s. 
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Examining  Table  6.1,  it  is  clear  that  Win  p-i  has  good  efficiencies  over  a wide 
spectrum  of  distributions,  ranging  from  the  multivariate  normal  case  {v  = 1)  to 
heavy-tailed  cases  {v  = .1,  .5)  and  light-tailed  cases  {v  = 2,  5).  Its  efficiencies 
are  always  close  to  those  of  Hotelling-Hsu  and  in  heavy-tailed  cases  they  are 
considerably  better.  Moreover,  these  strong  efficiencies  are  stable  as  the  dimension 
increases,  unlike  the  efficiencies  of  PRn-  For  multivariate  normal  distributions. 

Win  p-1  is  virtually  as  good  as  Hotelling- Hsu’s  T^,  while  PRn  and  have  good 
efficiencies  only  for  small  dimensions  and  large  dimensions,  respectively.  For  the 
heavy-tailed  cases,  Sn  and  Win  p-i  have  efficiencies  that  are  always  higher  than 
one  while  those  of  PRn  are  only  so  for  the  cases  p = 2 and  p = 3.  For  the  light- 
tailed distributions,  the  efficiencies  of  Win  p-u  and  PRn  are  significantly  higher 
than  those  of  Sn,  with  PRn  being  the  best.  As  the  dimension  increases  to  oo,  the 
efficiencies  of  Win  p-i  and  approach  one  but  those  of  PRn  approach  .75.  Note 
that,  the  empty  cell  in  this  table  is  caused  by  the  fact  that  when  + p < 3,  the 
efficiencies  do  not  exist  (see  Jan  and  Randles,  1996). 

In  Table  6.2,  four  different  multivariate  t-distributions  are  considered.  These 
are  the  multivariate  t-distributions  with  degrees  of  freedom  3,  6,  10,  and  20.  The 
results  show  that  the  efficiencies  of  Win  p-i  and  Sn  are  not  only  higher  than  those 
of  PRn  but  also  those  efficiencies  are  always  higher  than  one,  except  for  those  of  Sn 
when  the  dimension  is  low  and  the  degrees  of  freedom  are  high.  Again  we  see  the 
performance  of  PRn  deteriorating  as  the  dimension  increases. 

In  summary,  one  can  conclude  that  Win  p-i  has  strong  efficiencies  for  all  cases 
considered  while  those  of  5„  and  PRn  are  only  strong  for  heavy-tailed  and  light- 
tailed distributions,  respectively.  The  efficiencies  also  demonstrate  that  VFj„  p_i 
is  more  efficient  than  for  heavy-tailed  cases  and  virtually  as  good  as  for 
multivariate  normal  distributions  and  distributions  with  lighter  tails. 
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6.3  Monte  Carlo  Results 

To  compare  the  performances  of  and  Win  p-i  with  respect  to  other  pro- 
cedures in  the  literature,  a Monte  Carlo  study  for  dimension  p = 4,  sample  size 
n = 30,  and  significance  level  a = .05  was  conducted.  Besides  Sn  and  Win  p-\i  the 
study  included  the  ANOVA  F test,  Hotelling-Hsu  T^,  Friedman’s  test  (Fk),  Iman, 
Hora,  and  Convor  (1984)  rank  transformation  version  of  the  ANOVA  F test  (Frt), 
and  Quade’s  test  (Wq).  Note  that  Quade’s  test  was  computed  using  the  range  as 
the  location-free  statistic  that  measures  the  variability  within  the  components  of 
each  observed  vector. 

Three  of  the  seven  procedures  considered  are  computed  on  the  transformed 
observations,  that  is  the  X’s.  These  are  T^,  and  Wni  p-i,  while  the  rest  of  the 
procedures  use  the  original  data,  the  Y’s.  The  test  statistics  F,  Fjjj’,  and  Wq  are 
compared  to  upper  a-th  quantile  of  an  F distribution  with  (p— 1)  and  (n— l)(p— 1), 
numerator  and  denominator  degrees  of  freedom,  respectively.  No  adjustment  for 
the  degrees  of  freedom  was  made  since  we  noticed  that  doing  so  will  only  correct 
the  a level  of  the  test  but  will  not  have  any  significant  effect  on  the  power  (see 
Jan  and  Randles,  1996).  On  the  other  hand,  the  tests  Fr,  and  Win  p-i  a.re 
compared  to  the  upper  a-th  quantile  of  a chi-square  distribution  with  p — 1 degrees 
of  freedom.  The  Hotelling-Hsu  is  compared  to  (n  — l)(p  — l)/(n  — p -1- 1)  times 
the  upper  a-th  quantile  of  an  F distribution  with  (p  — 1)  and  (n  — p -1-1),  numerator 
and  denominator  degrees  of  freedom,  respectively. 

The  test  statistics  were  compared  under  elliptically  symmetric  distributions 
with  densities  from  the  power  family  and  the  family  of  multivariate  t-distributions. 
From  the  power  family,  the  heavy-tailed  case  {u  = .1),  the  multivariate  normal 
case  {u  = 1),  and  the  light-tailed  case  {ly  = 50)  were  used.  From  the  family  of 
multivariate  f-distributions,  distributions  with  degrees  of  freedom  of  1,  3,  and 
10  were  considered.  For  test  statistics  that  use  the  original  observations,  the 
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distributions  were  located  at  r = {kS,  kS,  kS,  0),  A:  = 0, 1, 2, 3,  and  for  the  rest  of  the 
tests,  the  distributions  were  located  at  0 = {k5,  kS,  kS),  k = 0, 1, 2, 3.  The  value  5 
was  adjusted  for  different  distributions  to  obtain  somewhat  similar  points  on  the 
power  curve.  Two  different  types  of  the  population  variance-covariance  matrix  were 
considered,  the  first,  which  satisfies  Huynh  and  Feldt  condition,  is  S = I4,  the 
identity  matrix,  the  other,  which  does  not  satisfy  the  Huynh  and  Feldt  condition  is 


14  -1  -2  16 


S = EE'  = 


-1  22 

-2  1 


1 1 
3 -7 


16  1 -7  30 


(6.16) 


This  was  done  in  order  to  show  the  dependence  of  the  performances  of  F,  Fr,  and 
Frt  on  the  type  of  the  population  variance-covariance  matrix.  Tables  6.3  and  6.4 
contain  the  results  from  the  Monte  Carlo  study  done  for  distributions  from  the 
power  family,  while  Tables  6.5  and  6.6  contain  those  for  the  distributions  from  the 
family  of  t-distributions.  Entries  in  each  table  are  the  proportion  of  times  out  of 
3000  iterations  in  which  each  test  statistic  exceeded  the  upper  a-percentile  of  the 
corresponding  null  distribution. 

For  the  affine-invariant  test  statistics  considered,  that  is,  T^,  Sn,  and  p_i, 
the  results  from  the  Monte  Carlo  studies  agree  with  those  obtained  from  the 
efficiency  comparisons.  Thus,  for  the  multivariate  normal  distribution,  it  is  clear 
that  the  three  tests  behave  very  similarly,  with  having  the  highest  power.  For 
the  heavy-tailed  distribution  from  the  power  family  {v  = .1),  the  performance  of 
deteriorates  dramatically,  while  those  of  and  Win  p-i  are  still  strong  with 
being  the  better  of  the  two.  For  the  light-tailed  case  of  the  power  family  (1/  = 50), 
the  power  of  Sn  is  clearly  smaller  than  the  other  two  statistics.  Moreover,  for  the 
multivariate  t-distributions,  the  performances  of  the  three  statistics  became  very 
similar  as  the  degrees  of  freedom  increased.  While  for  small  degrees  of  freedom. 


73 


Sn  has  the  strongest  performance  followed  by  a good  performance  from  Win  p-i  ) 
while  that  of  deteriorates  significantly.  As  expected,  these  performances  did  not 
change  when  the  structure  of  the  populations  variance-covariance  matrix  changed 
from  I4  to  EE'.  This  can  be  seen  by  comparing  table  6.3  with  Table  6.4  and  Table 
6.5  with  Table  6.6. 

Comparing  all  seven  tests,  the  following  observations  are  made.  When  the 
underlying  distribution  is  multivariate  normal  with  a variance-covariance  matrix 
that  satisfies  Huynh  and  Feldt  condition  (S  = I4),  all  seven  tests  showed  a 
good  performance,  with  F being  the  best  (see.  Table  6.3,  case  u = 1).  However, 
when  the  variance-covariance  matrix  did  not  satisfy  Huynh  and  Feldt  condition 
(S  = EE'),  only  the  affine-invariant  tests  maintained  a strong  performance,  while 
all  the  other  four  tests,  showed  a significant  decrease  in  power  and  they  didn’t 
achieve  the  designed  .05  a-level  of  the  test  (see.  Table  6.4,  case  u = 1).  For  the 
heavy-tailed  distribution  from  the  power  family,  with  E = I4,  S„  and  Fr  showed 
the  highest  powers,  followed  by  Frt,  Wi„  p-i,  and  Wq,  with  a good  performance, 
while  T^,  and  F didn’t  performance  well  (see.  Table  6.3,  case  u = .1).  On  the 
other  hand,  when  E = EE',  only  Sn  kept  a strong  performance,  the  performance  of 
Fr  decreased  to  become  comparable  to  that  of  Win  p-u  while  the  rest  of  the  tests 
performed  poorly  (see.  Table  6.4,  case  u = .1).  As  for  the  light-tailed  distribution 
from  the  power  family,  with  E = I4,  Wq,  and  F performed  the  best,  followed  by 
good  performances  from  T^,  Frt,  and  VFj„  p_i,  while  Sn  and  Fr  performed  poorly 
(see.  Table  6.3,  case  = 50).  However,  when  E = EE',  T^,  and  Win  p-i  kept  the 
strongest  performances  with  Sn  performing  well,  while  all  other  tests  performed 
poorly  (see.  Table  6.4,  case  u = 50). 

Now,  results  from  Monte  Carlos  studies  for  the  family  of  f-distributions  are 
presented.  For  the  multivariate  t-distribution  with  1 degree  of  freedom  and  E = I4, 
Fr,  and  <S„  performed  the  best,  followed  by  good  performances  from  Frt,  and 
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Win  p-1)  while  the  rest  of  the  tests  performed  poorly,  especially  F,  and  (see, 
Table  6.5,  case  df=l).  On  the  other  hand,  when  E = EE',  the  performance  of  Frt 
dropped  to  below  that  of  Win  p-i  with  Sn  still  being  the  best  and  all  the  other  tests 
performing  poorly  (see.  Table  6.6,  case  df=l).  For  the  multivariate  t-distribution 
with  3 degrees  of  freedom  and  E = I4,  all  seven  tests  performed  well,  with  Frt, 

Fr,  and  Sn  being  the  best  (see.  Table  6.5,  case  df=3).  However,  when  E = EE', 
only  Sn,  Win  p-i,  and  maintained  a strong  performance,  with  a slight  advantage 
of  Sn  and  Win  p-i  over  (see.  Table  6.6,  case  df=3).  Finally,  for  the  multivariate 
t-distribution  with  10  degrees  of  freedom  and  E = I4,  all  seven  tests  have  virtually 
the  same  performance,  with  Frx  the  highest  power  (see.  Table  6.5,  case  df=10). 

On  the  other  hand,  when  E = EE',  only  T^,  5„  and  Win  p-i  maintained  a strong 
performance,  while  the  performances  of  the  other  tests  deteriorated. 

In  summary,  the  dependence  of  the  tests  F,  Fr^,  Fr,  and  Wq  on  the  structure 
of  the  variance-covariance  matrix  of  the  underlying  population  was  clear  under  all 
different  distributional  assumptions.  The  effect  was  obvious  as  the  powers  of  these 
tests  deteriorated  and  the  sizes  of  the  tests  shifted  grossly  from  the  .05  a-level. 

For  heavy-tailed  distributions,  5„,  was  the  best  overall  performance,  with  Win  p-i 
and  Fr  having  good  performances.  While  for  the  light-tailed  distributions  p_i 
and  were  always  good  choices  followed  by  Sn-  Thus,  one  can  conclude  that  the 
performance  of  Win  p-i  was  good  for  all  settingS  studied. 
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Table  6.3:  Monte  Carlo  Results  for  the  Power  Family,  with  E=l4 


Test  Statistic 

V 

Amount  of  Shift 

.1 

0 

lA 

2A 

3A 

rp2 

0.0233 

0.0620 

0.1953 

0.4100 

Sn 

0.0387 

0.2663 

0.6740 

0.8960 

p—1 

0.0417 

0.1600 

0.4443 

0.6830 

F 

0.0867 

0.0553 

0.1517 

0.3210 

Frt 

0.0477 

0.1567 

0.4957 

0.7740 

Fr 

0.0420 

0.2747 

0.6777 

0.8987 

Wq 

0.0427 

0.1070 

0.3050 

0.5597 

1 

rp2 

0.0500 

0.1353 

0.4277 

0.7813 

Sn 

0.0437 

0.1150 

0.3603 

0.7173 

p—1 

0.0470 

0.1203 

0.3790 

0.7483 

F 

0.0430 

0.1443 

0.4703 

0.8300 

Frt 

0.0477 

0.1403 

0.4570 

0.8100 

Fr 

0.0420 

0.1060 

0.3613 

0.7120 

Wq 

0.0413 

0.1233 

0.4030 

0.7727 

50 

rp2 

0.0510 

0.1360 

0.4077 

0.7847 

Sn 

0.0477 

0.0963 

0.2700 

0.5903 

p—1 

0.0480 

0.1193 

0.3600 

0.7337 

F 

0.0473 

0.1443 

0.4617 

0.8403 

Frt 

0.0510 

0.1310 

0.4060 

0.7773 

Fr 

0.0493 

0.1030 

0.2857 

0.5850 

Wq 

0.0493 

0.1470 

0.4753 

0.8523 

Entries:  the  proportion  of  times  out  of  3000  iterations  in  which  each  test 
statistic  exceeded  the  upper  a-percentile  of  its  asymptotic  null  distribution. 
Distribution  = Power  Family. 
a = .05. 


E 

Sample  size 
Dimension 

fji2 

Sn 

I'Fin  p— 1 
F 

Frt 

Fr 

Wq 


= I4 

= 30. 

= 4. 

= Hotelling-Hsu  procedure. 

= Randles  (2000)  procedure. 

= New  procedure  with  the  linear  (/»(•)  function. 
= ANOVA  F Test. 

= Rank  Transformation  ANOVA  Test. 

= Friedman’s  Test. 

= Quade’s  Test. 


76 


Table  6.4:  Monte  Carlo  Results  for  the  Power  Family,  with  E=EE' 


Test  Statistic 

V 

Amount  of  Shift 

.1 

0 

lA 

2A 

3A 

fji2 

0.0273 

0.0697 

0.2157 

0.4350 

Sn 

0.0440 

0.2750 

0.6827 

0.9087 

p—1 

0.0433 

0.1700 

0.4487 

0.6997 

F 

0.0517 

0.0670 

0.0977 

0.1587 

Frt 

0.0707 

0.1087 

0.2217 

0.4017 

Fr 

0.0530 

0.1570 

0.4020 

0.6697 

Wq 

0.0650 

0.0983 

0.1887 

0.3080 

1 

rj^2 

0.0470 

0.1380 

0.4663 

0.8463 

Sn 

0.0433 

0.1143 

0.3910 

0.7640 

p—1 

0.0423 

0.1210 

0.4150 

0.8047 

F 

0.0670 

0.1017 

0.2147 

0.4027 

Frt 

0.0817 

0.1073 

0.1840 

0.3227 

Fr 

0.0617 

0.0910 

0.1833 

0.3823 

Wq 

0.0690 

0.1003 

0.1990 

0.3770 

50 

rp2 

0.0533 

0.1553 

0.5060 

0.8923 

Sn 

0.0433 

0.1120 

0.3410 

0.7157 

p—1 

0.0443 

0.1353 

0.4543 

0.8540 

F 

0.0743 

0.1077 

0.2330 

0.4430 

Frt 

0.0843 

0.1037 

0.1817 

0.3223 

Fr 

0.0570 

0.0837 

0.1703 

0.3377 

Wq 

0.0773 

0.1130 

0.2317 

0.4227 

Entries:  the  proportion  of  times  out  of  3000  iterations  in  which  each  test 
statistic  exceeded  the  upper  a-percentile  of  its  asymptotic  null  distribution. 
Distribution  = Power  Family. 


a 


S 

Sample  size 
Dimension 


= .05. 
= EE', 
= 30. 
= 4. 


rp2 

Sn 

p—1 

F 

Frt 

Fr 

Wq 


= Hotelling-Hsu  procedure. 

= Randles  (2000)  procedure. 

= New  procedure  with  the  linear  function. 

= ANOVA  F Test. 

= Rank  Transformation  ANOVA  Test. 

= Friedman’s  Test. 

= Quade’s  Test. 
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Table  6.5:  Monte  Carlo  Results  for  the  Family  of  t-Distributions,  with  S=l4 


Test  Statistic 

df 

Amount  of  Shift 

1 

0 

lA 

2A 

3A 

rj-\2 

0.0183 

0.0337 

0.1043 

0.2250 

Sn 

0.0430 

0.1413 

0.4797 

0.7750 

p— 1 

0.0423 

0.1100 

0.3387 

0.5873 

F 

0.0163 

0.0237 

0.0513 

0.1000 

Frt 

0.0510 

0.1180 

0.3677 

0.6510 

Fr 

0.0507 

0.1510 

0.4987 

0.8040 

Wq 

0.0473 

0.0977 

0.2590 

0.4713 

3 

rp2 

0.0400 

0.1263 

0.4253 

0.7477 

Sn 

0.0397 

0.1507 

0.5230 

0.8603 

p— 1 

0.0390 

0.1397 

0.4750 

0.8040 

F 

0.0373 

0.1180 

0.3810 

0.7267 

Frt 

0.0443 

0.1600 

0.5440 

0.8777 

Fr 

0.0403 

0.1553 

0.5443 

0.8750 

Wq 

0.0443 

0.1213 

0.4397 

0.7650 

10 

fji2 

0.0463 

0.1150 

0.3787 

0.7150 

Sn 

0.0397 

0.1060 

0.3503 

0.6830 

p—1 

0.0413 

0.1133 

0.3507 

0.6897 

F 

0.0433 

0.1200 

0.4030 

0.7490 

Frt 

0.0493 

0.1247 

0.4207 

0.7673 

Fr 

0.0463 

0.1097 

0.3607 

0.6973 

Wq 

0.0493 

0.1093 

0.3573 

0.6903 

Entries:  the  proportion  of  times  out  of  3000  iterations  in  which  each  test 
statistic  exceeded  the  upper  a-percentile  of  its  asymptotic  null  distribution. 
Distribution  = Multivariate  t-distribution. 
a = .05. 


S 

Sample  size 
Dimension 

rp2 

Sn 

IFjn  p_i 

F 

Fbt 

Fr 

Wq 


= I4. 

= 30. 

= 4. 

= Hotelling-Hsu  procedure 
= Randles  (2000)  procedure. 

= New  procedure  with  the  linear  (j){-)  function. 
= ANOVA  F Test. 

= Rank  Transformation  ANOVA  Test. 

= Friedman’s  Test. 

= Quade’s  Test. 


78 


Table  6.6:  Monte  Carlo  Results  for  the  Family  of  t-Distributions,  with  S=EE' 


Test  Statistic 

df 

Amount  of  Shift 

1 

0 

lA 

2A 

3A 

fji2 

0.0177 

0.0433 

0.1580 

0.3227 

Sn 

0.0397 

0.1977 

0.6210 

0.8963 

p—1 

0.0387 

0.1463 

0.4380 

0.7083 

F 

0.0347 

0.0353 

0.0440 

0.0733 

Frt 

0.0663 

0.1003 

0.1997 

0.3533 

Fr 

0.0600 

0.1167 

0.3230 

0.6090 

Wq 

0.0613 

0.0907 

0.1753 

0.3220 

3 

rji2 

0.0423 

0.1297 

0.4483 

0.7870 

Sn 

0.0407 

0.1710 

0.5630 

0.8980 

p—1 

0.0417 

0.1467 

0.5013 

0.8490 

F 

0.0647 

0.0863 

0.1700 

0.3397 

Frt 

0.0750 

0.0970 

0.2050 

0.4093 

Fr 

0.0610 

0.0953 

0.2667 

0.5523 

Wq 

0.0610 

0.0910 

0.2077 

0.4000 

10 

rji2 

0.0407 

0.2143 

0.6940 

0.9653 

Sn 

0.0410 

0.1887 

0.6510 

0.9617 

p—1 

0.0390 

0.1927 

0.6620 

0.9607 

F 

0.0717 

0.1273 

0.3153 

0.6137 

Frt 

0.0733 

0.1173 

0.2760 

0.5470 

Fr 

0.0560 

0.1117 

0.3163 

0.6573 

Wq 

0.0633 

0.1217 

0.3103 

0.5733 

Entries:  the  proportion  of  times  out  of  3000  iterations  in  which  each  test 
statistic  exceeded  the  upper  a-percentile  of  its  asymptotic  null  distribution. 
Distribution  = Multivariate  t-distribution. 

O'  = .05. 

E = EE'. 

Sample  size  = 30. 

Dimension  = 4. 


rp2 


p—1 

F 

Frt 

Fr 


= Hotelling-Hsu  procedure. 

= Randles  (2000)  procedure. 

= New  procedure  with  the  linear  0(*)  function. 
= ANOVA  F Test. 

= Rank  Transformation  ANOVA  Test. 

= Friedman’s  Test. 

= Quade’s  Test. 


CHAPTER  7 

SUMMARY,  CONCLUSIONS,  AND  FUTURE  RESEARCH 

7.1  Summary  and  Conclusions 

In  this  work,  the  multivariate  one-sample  location  problem  was  investigated.  A 

class  of  affine-invariant  test  statistics  was  introduced.  Its  small-sample  conditional 

distribution-free  property  was  established.  The  asymptotic  null  distribution  as  well 

as  the  asymptotic  distribution  under  a sequence  of  alternatives  approaching  the 

null  were  obtained  under  the  minimal  assumption  of  symmetry  of  the  underlying 

distribution  of  the  observations.  Moreover,  a particular  member  of  this  class  was 

recommended.  Comparisons  between  this  member  and  several  competitors  in  the 

« 

literature  were  based  on  the  values  of  the  corresponding  Pitman  asymptotic  relative 
efficiencies  of  these  tests  and  on  the  performances  of  these  tests  as  seen  in  several 
Monte  Carlo  studies. 

Further,  an  application  of  this  new  class  of  test  statistics  to  the  simple 
repeated-measures  designs  was  established.  The  Pitman  asymptotic  relative 
efficiencies  of  two  particular  members  were  evaluated  and  then  compared  to  those 
of  another  test  statistic  previously  proposed  for  these  designs.  More  comprehensive 
comparisons  between  these  two  members  and  several  leading  competitors  were  done 
through  Monte  Carlo  studies. 

The  comparisons  presented  show  that  in  the  one-sample  location  problem, 
the  recommended  statistic  has  strong  efficiencies  and  performs  very  well  over  a 
wide  spectrum  of  distributions,  ranging  from  very  light-tailed  distributions  to  very 
heavy-tailed  ones.  In  particular,  these  distributions  were  taken  from  two  specific 
members  of  the  class  of  elliptically  symmetric  distributions;  the  power  family  and 
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the  family  of  multivariate  t-distributions.  Moreover,  the  test’s  efficiencies  and 
performance  didn’t  deteriorate  when  the  dimension  was  increased  as  is  the  case  for 
some  of  the  other  tests  in  the  literature.  The  best  feature  of  this  statistic,  which  is 
the  ease  with  which  it  can  be  computed  for  data  in  any  practical  dimension.  This 
computational  advantage  was  seen  through  the  difference  in  computation  times  of 
Monte  Carlo  studies  when  other  test  statistics  were  included  or  excluded. 

In  regard  to  the  application  to  the  simple  repeated-measures  designs,  similar 
results  were  seen.  The  two  new  test  statistics  considered  had  strong  efficiencies  and 
performances.  Moreover,  these  performances  were  not  affected  by  the  structure 
of  the  population  variance-covariance  matrix,  as  is  the  case  for  many  of  the  test 
statistics  often  recommended  for  such  designs. 

In  summary,  one  can  conclude  that  the  new  statistic  is  not  only  efficient 
but  also  practical  to  use.  Its  affine-invariance  property,  minimal  assumption 
requirement  regarding  the  population  underlying  distribution,  strong  efficiencies, 
and  ease  of  computation  make  it  one  of  the  best  test  statistics  to  use  both  for 
one-sample  location  problems  and  simple  repeated-measures  designs. 

7.2  Future  Research 

In  the  future,  I would  like  to  investigate  other  members  of  this  proposed  class 
for  improved  efficiencies  and  performance.  Also,  other  applications  of  this  test 
statistic  will  be  researched  and  its  performance  will  be  compared  to  other  existing 


test  statistics. 


APPENDIX  A 
COMPUTING  A 


^ 

To  compute  A,  an  iterative  procedure  is  used.  It  begins  with 


S„=n-'^( 
»=1  ^ 


Y,- 


Y, 


YiM  ViiYiliy  ’ 


and  forms  Aq  = Chol(So^),  where  Chol(M)  denotes  the  upper  triangular  Cholesky 

factorization  of  the  positive  definite  matrix  M (see  Lay,  1994),  divided  by  the 

f Vi 

upper-left  element  of  that  upper  triangular  matrix.  At  the  iteration,  form 


A ^ A ^ 

Adt  = At_i  At_2  • • • Ao, 


and 


..  j A(jt  Yj 

y/it  = — for  t = 

AdtYi 


n 


i=l 


(A.1) 


If  II St  — p“^I||  is  sufficiently  small,  then  stop  and  set  A^  = A^f  If  not,  then 


compute  At  = Chol(St  and  go  back  to  step  (A.l). 
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APPENDIX  B 

UNDERLYING  DISTRIBUTIONS  AND  RELATIONS  AMONG  THEM 


As  can  be  seen  from  the  literature  review  that  the  assumption  about  the 
underlying  distribution  of  the  Y’s  varied  from  one  test  statistic  to  another.  What 
follows  is  a brief  definition  of  these  distributional  assumptions  with  an  explanation 
about  the  relationships  between  them. 

Parametric  procedures  usually  have  the  most  restrictive  assumption,  namely 
that  the  underlying  population  is  a multivariate  normal  distribution.  If  Yi,  • • • , Y„ 
come  from  a multivariate  normal  distribution  then  each  Yj  can  be  written  as 

Yi  = RiDUi  + 0,  (B.l) 

where 

• Uj’s  are  iid  uniformly  distributed  on  the  unit  p-sphere, 

• -Ri  ~ V (p)  I Pi  independent  of  the  Uj’s,  and 

• D is  a fixed  nonsingular  p x p matrix. 

The  family  of  multivariate  normal  distributions  is  a member  of  the  larger  class  of 
elliptically  symmetric  distributions.  In  this  less  restrictive  class,  the  representa- 
tion in  (B.l)  still  holds  but  now  the  RiS  are  assumed  to  be  iid  positive  random 
variables,  not  necessarily  having  a chi-square  distribution  but  still  independent 
of  the  Uj’s.  This  class  also  includes,  the  power  family,  the  family  of  multivariate 
t-distributions,  the  family  of  multivariate  logistic  distributions  and  many  more. 

Every  member  of  the  class  of  elliptically  symmetric  distributions  is  also  a 
member  of  the  class  of  distributions  with  elliptical  directions.  Distributions  in  the 
later  class  are  also  represented  as  in  (B.l),  but  only  require  the  Ri%  to  be  positive 
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numbers,  not  even  iid  or  independent  of  the  Uj’s.  Hence,  this  class  includes  skewed 
distributions  since  the  Ri's  can  be  large  in  some  directions  and  small  in  others. 

The  class  of  elliptically  symmetric  distributions  is  also  a subclass  of  the  class  of 
symmetric  distributions,  for  which 

(Yi  - 61)  = -{Yi  - e). 

Thus,  the  probability  at  a point  on  any  axis  through  6 is  the  same  as  the  probabil- 
ity at  its  reflection  through  d on  that  same  axis. 

A more  general  class  is  one  whose  distributions  are  such  that  the  probability  of 
being  at  one  side  of  any  axis  through  B is  equal  to  the  probability  of  being  at  the 
other  side  of  that  axis.  Thus,  if  Y is  a member  of  this  class  then 


Y-e  d 

Y -e\ 


Y-e 
Y-e  ■ 


Distributions  with  this  property  are  said  to  belong  to  the  class  of  directionally  sym- 
metric distributions.  It  is  clear  that  every  symmetric  distribution  is  directionally 
symmetric,  and  every  distribution  with  elliptical  directions  is  also  directionally 
symmetric. 

All  the  relationships  between  these  various  classes  are  presented  in  the  flgure 
B.l  (see  next  page). 
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Figure  B.l:  Relationships  Among  Various  Classes  of  Distributions 
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